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ABSTRACT 


In  1965  the  U.  S.  Naval  Safety  Center,  Norfolk,  Virginia  forded  the  first  of  a 
series  of  studies  of  human  error  in  aircraft  accidents,  entitled  "Human  Error 
Research  and  Analysis  Program  (HERAP)  ."because  human  error  caused  accidents 
still  accounted  for  an  average  of  55%  of  the  total  and  constituted  a  secondary 

remain  tnq^5%,  after  twenty 


years  ’D'f'decTtne  irr'accidefi . 


The  program  began  with  a  detailed  systems  analysis  of  the  Navy  accident  report¬ 
ing  program  and  the  pilot  training  program.  With  the  cooperation  of  several 
departments  of  the  Navy  a  data  bank  was  prepared,  which  contained  personal 
data,  training  data  and  individual  flight  time  data  for  each  of  17,000  Navy 
pilots.  It  also  contained  all  mishap  data  from  1962  to  1968.  This  data  was 
merged  into  a  single  file  and  manipulated  by  various  statistical  proqrams 
developed  or  modified  for  HERAP.  An  Exposure  Index  was  developed  to  equalize 
Individual  exposure  to  risk. 

A  study  was  performed  to  establish  possible  relationships  between  a  pilot's 
mishap  record  and  his  education,  training,  demographic  characteristics,  and 
flying  experience,  and  to  test  the  distribution  of  mishaps  of  the  pilots  for 
randomness.  Each  pilot's  mishap  record  was  compared  to  the  cumulative  binomial 
distribution  to  establish  a  safety  rating  based  on  the  probability  that  he  should 
have  had  a  worse  mishap  record  by  chance  alone.  A  second  safety  rating  was 
developed  by  comparing  expected  and  actual  mishaps  for  each  pilot.  Using  a 
stepwise  regression  analysis,  a  study  was  made  of  the  functional  dependence  of 
the  safety  ratings  on  the  data.  The  highest  multiple  correlation  coefficient 
was  obtained  by  predicting  the  binomial  ranking  from  the  variables  age,  months 
of  combat,  last  year  of  combat,  rank,  and  type  flying  experience. 


catigue,  while  frequently  citec  as  a  contribution  factor  in  accidents  remains 
an  elusive  element  in  terms  of  definition,  recognition,  measurement,  prediction 
and  al leviation. 


Fatigue,  defined  as  "The  degradation  in  performance  or  affective  state,  resulting 
from  .previous  work,"  was  explored  in  1968  -  1969  in  a  complex  simulation  experi¬ 
ment  using  an  anti-submarine  warfare  flight  trainer.  Light  work  load  over  two 
nours  and  heavy  work  load  over  eight  hours  were  imposed  upon  four  professional 
•crews.  Each  crew  was  in  a  controlled  work  environment  24-hours  a  day  over  a 
2-week  period.  Psychological,  physiological,  biochemical,  and  performance 
measures  were  taken  for  individual  and  crew  as  a  group. 

Some  statistically  significant  effects  in  the  physiological  and  blood  chemistry 
measures  were  found.  Bartlett's  hypothesis  in  regard  to  increased  variability 
of  performance  and  perceptual  breakup,  and  the  activation  pattern  hypothesis 
were  to  some  extent  verified,  although  it  was  indicated  that  heavier  or  longer 
loads  would  be  needed  for  greater  significance. 


\ 


In  a  second  study  in  1969  -  1970.  a  state  of  fatique,  Induced  by  continuous 
performance  of  tasks  over  prolonged  working  time',  was  studied  in  a  laboratory 
environment.  The  tasks  were  selected  to  resemble  the  airborne  activities 
necessary  to  a  prolonged  ASW  mission. 


tha  t— ill  -  su  b s  i-ii  ^-  Uintv  -Wer-h— Hrad — -- 


(HWL)  group  worked  without  rest  for  approximately  18  hours  and  1?  different 
subjects  in  a  Low  Work  Lead  (LWl)  group  worked  for  the  same  time  span,  but  were 


permitted  equal  time  on  and  off  work,  such  that  their  workload  was  approximately 


or,e*half  that  of  tne  HWL  group.  Psychological,  physiological,  biochemical,  and 


performance  measures  were  taken  for  each  individual  subject. 


Performance  decrement  was  obtained  for  tne  HWL  reflecting  at  least  moderate 
fatigue.  A  specially  designed  Discrete  Tracking  Task  was  the  most  sensitive 
■index  of  fatigue.  The  task  was  moderately  difficult  with  almost  excessive  tempora  1 
demands.  The  hWL  group  also  made  larger  time  estimation  errors  late  in  the 
sessicr  than  the  LWL  group.  No  other  psychological .biochemical ,  or  physiological 
variable  was  correlated  with  performance  decrement  (Fatigue)  it'  this  study. 
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PART  I 

BACKGROUND  SUMMARY 
INTRODUCTION _ _ _ _ _ _ _ 


In  July,  1965  Douglas  Aircraft  Company,  tong  Beach,  California,  submitted  a  pro¬ 
posal  to  the  Navy  Safety  Center,  Norfolk,  Virginia,  for  a  short-term  investigation 
of,  and  planning  of,  a  long-term  study  of  an  Aircraft  Accidents  and  Development 
of  Accident  Prevention  Program.  The  study,  entitled,  "Human  Error  Research  and 
Analysis  Program  (HERAP)",  was  the  first  of  a  series  of  HERAP  studies  performed 
s'nce  that  time.  This  report  reviews  and  summarizes  those  studies. 

A  BRIEF  HISTORY  OF  NAVAL  AIRCRAFT  SAFETY 

In  the  first  few  years  of  Aviation  success  was  the  criterion  by  which  flights  were 
judged.  If  it  worked,  it  was  repeated  -  if  not,  changes  were  made.  This  same 
criterion  was  dominant  when  the  U.S.  Navy  formed  its  first  Aeronautical  Organiza¬ 
tion  under  Captain  Washington  Chambers  in  October  of  1910.  In  November,  1910 
Eugene  Ely  made  tne  first  take-off  from  the  deck  of  a  ship,  and  in  January,  1911 
landed  a  hydroplane  beside  a  ship,  was  hoisted  aboard,  lowered  into  the  water, 
and  took  off  to  successfully  return  to  shore. 

Safety  was  a  conscious  part  of  these  efforts  -  by  1912  Captain  Chambers  was  speak¬ 
ing  of  the  need  for  mechanical  devices  to  aid  the  pilot  "which  will  perform  .  ,  . 
the  things  he  is  prone  to  do  indifferently,  especially  under  the  strain  of  fatigue 
•  .  .".  In  that  same  year,  Captain  Chambers  brought  about  the  first  catapult 
launching  -  Naval  Aviation  was  on  its  way. 

By  1916,  efforts  were  well  under  way  to  standardize  personnel  and  aircraft  design, 
production,  inspection,  and  testing.  While  numerous  safety  orders  and  regulations 
were  issued,  one  essential  item  -  thorough  record  keeping  -  was  lagging.  The 
earliest  preserved  accident  record  was  a  5  k  6  card  written  on  one  side. 

An  Aviation  Medical  Research  Board  was  formed  in  1917  and  immediately  established 
a  Research  Laboratory  at  Hazelhurst  Field,  Mineola,  Long  Island.  In  1939,  the 
Navy  established  a  School  of  .Aviation  Medicine  and  Research  at  Pensacola,  Florida. 
Throughout  the  years,  from  the  end  of  the  World  War  I,  to  the  beginning  of  World 
War  II,  standards  for  the  selection  and  training  of  pilots  continued  to  improve. 
However,  Armstrong  (1952)*  states,  "The  increased  performance  of  aircraft 
attained  by  about  1930  resulted  in  a  situation  wherein  the  human  element  in 
flight  was  becoming  the  weakest  link  in  the  chain! "  A  remark  which  appears  to  be 
no  less  true  today. 

THE  BACKGROUND  FOR  THE  HERAP  STUDIES 

As  early  as  1955,  the  Navy  Safety  Center  had  requested  assistance  in  planning  for 
personnel  and  equipment  to  improve  the  analysis  of  Navy  Accident  Data.  The  result¬ 
ing  report  stimulated  further  efforts  by  Safety  Center  personnel  and  culminated 
in  1965  when  additional  personnel  and  equipment  were  authorized  and  the  HERAP 
effort  was  funded , 


(*)  As  quoted  in  HERAP,  First  Planning  Study,  UAC ,  1 96)0 


The  HERAP  effort  was  based  on  the  fact  that  while  the  general  trend  of  Navy  air¬ 
craft  accidents  was  downward,  the  percent  of  those  accidents  attributed  to  human 
error  was  consistently  greater  than  50*  of  the  total.  More  than  two-thirds  of 
the  Navy  aircraft  accidents  have  human  factors  listed  as  a  rnntHhijHng  cause. 

The  effort  expended  to  reduce  this  factor,  however,  has  been  historically  too 
little  in  proportion  to  Its  Importance.  The  effort  to  reduce  accl dentS_anjl^Lfiit— __ 
as  socl  a  ted  costs  was  given  added  impetus  wherthe'oMfiavy  ^cTSernTra  te  rose  1  n 
Fiscal  1966  after  many  years  of  annual  reduction. 

Reference  2  describes  the  change  as  fellows: 

"The  downward  trend  in  the  All  Navy  Accident  rate  was  interrupted  in  Fiscal  1966. 
While  flying  3,739,856  hours,  475  accidents  occurred  with  a  resulting  accident 
rate  of  1.27  compared  to  457  and  a  rate  of  1.25  in  Fiscal  1965.  Although  the 
rate  increase  was  very  small,  Fiscal  1966  was  the  first  year  since  the  Korean  War 
years  that  an  improvement  was  not  experienced  in  the  All  Navy  rate,  making  this 
the  third  time  in  twenty  years  that  the  accident  rate  has  increased  (the  last  two 
occurrences  were  in  Fiscal  Years  1951  and  1952).’’ 

The  primary  objectives  of  the  HERAP  Investigations  were: 

a.  to  identify  and  describe  those  characteristics  which 
differentiate  successful  pilots  from  unsuccessful  pilots 
in  terms  of  their  susceptibility  to  accidents  and  their 
contribution  to  fleet  readiness; 

b.  once  identified,  ic  order  these  characteristics  into  meaningful 
relationships  for  the  purpose  of  prediction  and  control. 

The  research  objectives  included  short,  intermediate  and  long-term  investigations 
leading  toward  the  ideal  goal  of  eliminating  aircraft  accidents. 

The  HERAP  Planning  Study 

The  first  HERAP  Planning  Study  investigated  information  sources  and  requirements 
within  the  Naval  establishment.  A  preliminary  mission-system- task  analysis  was 
performed  on  an  A-4  aircraft  ferry  mission  to  demonstrate  its  applicability  to 
human  error  accident  investigations.  Data  reporting  and  analysis  procedures  were 
reviewed  and  the  literature  on  accidents  was  reviewed.  A  program  for  the  develop¬ 
ment  of  Improved  data  gathering  was  proposed,  as  well  as  a  computer  system  adequate 
to  the  task  facing  the  Naval  Safety  Center  (NSC). 

One  of  the  critical  elements  of  the  Planning  study  was  an  analysis  of  resource 
requirements  sufficient  to  predict  measurable  gains  in  understanding  and  alleviat¬ 
ing  human  error  as  a  cause  of  accidents.  The  generalized  model  of  the  plan  in 
shown  in  Figure  1.  Also  recommended  were  five  critical  areas  of  concern  about  which 
sufficient  knowledge  is  lacking,  but  which  are  generally  believed  to  be  closely 
related  to  human  error.  Among  these  are  fatigue,  disorientation,  carrier  landing, 
collision  avoidance  and  an  index  of  exposure  for  flying  in  the  U.S.  Navy. 
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The  Philosophical  Foundations  of  HERAP  Research 

During  the  initial  HERAP  investigations,  in  which  many  activities  interested 
in  the  Naval  Aviation  safety  were  visited,  viewpoints  on  accident  reduction 
due  to  hurran  error  could  be  summarized  in  three  philosophical  approaches. 


The  first  was  one  which  states  that  a  method  to  reduce  the  frequency  of  mishaps 

is  to  detect  .and  reronvp  pprsnnnel  'Wha  far  nne  reason  nr  another  aro-  nnt — — - - 

proficient  enough  for  the  demanded  flying  tasks.  A  second  was  to  detect  and 
remove  the  equipment  and  procedural  stressors  which  in  some  way  caused  personnel 
to  fail  in  their  tasks.  A  third  was  that  problems  causing  mishaps  would  show  up 
in  data  in  an  empirical  fashion  and  special  studies  could  then  be  instigated  to 
deal  with  them.  Each  of  these  points  of  view  has  a  place  in  any  systematic 
program,  but  none  stands  on  Its  own  feet. 


In  the  first  case,  proficiency  is  a  function  of  innate  characteristics,  training 
and  task  difficulty.  If  selection  chooses  personnel  whose  capabilities  are 
adequate  to  be  modified  by  training  procedures  to  a  level  sufficient  for  the 
defined  task,  and  the  training  is  itself  adquately  structured  with  regard  to  the 
task,  detection  and  removal  of  personnel  for  lack  of  proficiency  will  apply 
only  to  those  who  are  not  performing  when  they  well  could;  that  is  we  are 
confining  ourselves  to  motivational  and  command  deviations.  On  the  other  hand,  if 
selection  and  training  are  not  adequately  matching  personnel  and  task,  the 
problem  of  urgency  is  not  only  detection  and  removal  of  personnel,  but  of 
restructuring  selection  and  training  procedures. 


In  the  second  case,  the  equipment  factor  follows  the  same  argument.  Equipment 
which  induces  operational  mishap  through  poor  design  should  most  certainly  be 
modified  and  replaced,  but  there  will  always  be  an  operator  who  can  use  it  without 
difficulty.  How  can  we  insure  that  equipment  is. designed  for  the  capabilities  of 
operators  without  knowing  something  about  expected  performance  of  our  population 
over  the  range  of  equipment? 


The  third  is  the  most  common  approach.  It  is  reflected  in  the  wide  variety  of 
data  categories  found  in  the  files  of  organizations  concerned  with  safety.  Cate¬ 
gories  are  based  on  detected  hazards  and  not  on  predicted  ones.  Where  this  is  true, 
the  range  of  variables  connected  with  a  particular  item  might  be  large  in  real 
life,  but  the  range  used  for  study  is  nevitably  small. 


The  approach  in  the  HERAP  studies  was  one  which  stated  that  we  should  establish 
knowledge  of  the  capabilities  cf  the  individual  and  grouped  personnel  in  regard  to 
the  current  and  predicted  tasks  which  they  must  perform.  For  personnel  already 
in  the  system,  data  existed  from  selection,  training,  readiness  and  operational 
sources  as  to  capabilities  of  certain  sorts.  It  was  possible  that  by  statistical 
study  we  might  relate  mishaps  to  these  measured  capabilities.  Some  of  this  work 
had  been  done  and  more  begun,  notably  at  Pensacola.  But  knowledge  of  capability 
in  regard  to  future  and  predicted  specific  tasks  had  not  been  carried  out  except 
in  isolated  studies.  Thus  we  attempted  to  isolate  the  tasks  in  some  systematic 
way,  update  these  over  the  long  term,  and  establish  levels  of  expected  performance 
for  them. 
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HUMAN  ERROR 


It  Is  of  little  use  to  attempt  the  description  of  a  lonq  range  accident  preven- 
t  IsrprggrsmTeTated'tO'WL'mee  err^r^wTtfiout^e^woHH-ng  »Hof4n  itf —  ^  — - 

human  error  is.  Like  many  terms,  this  one  has  a  stiqmatic  connotation 
related  to  "making  a  mistake"  and  this  is  unfortunate,  for  "making  a  mistake" 

Is  somewhat  a  dichotomous  notion  assuming  that  there  are  only  two  conditions 
of  consideration,  right  and  wrong. 

Because,  until  now.  the  primary  effort  in  human  error  research  has  been  an 
effort  to  aid  in  the  design  of  reliable  systems,  it  has  been  concentrated  in 
the  areas  of  system  design  and  development.  Studies  by  Cooper,  1961;  Meister, 

1962;  Willis,  1962  gave  impetus  to  the  consideration  of  the  impact  of  human 
error  on  system  performance.  It  thus  became  necessary  to  measure  the  operators 
potentiality  for  failure  (that  is,  his  potentiality  for  error)  so  that  this 
could  be  combined  with  those  of  other  system  components,  to  give  a  true 
indication  of  total  system  reliability.  Consequently,  human  error  has  been 
defined  within  the  framework  of  system  design  and  development,  due  to  these 
utility  goals. 

Human  error  in  the  usual  context  of  the  operator,  is  necessarily  a  wide  all 
inclusive  term  which  emphasizes  the  frailty  of  the  operator  without  recognizing 
that  there  is  a  "human  system"  which  can  fail.  The  human  system  failure  is 
pinpointed  when  the  operator  does  not  complete,  successfully,  trie  required 
operation.  The  "human  system"  complexity  has  stymied  many  studies,  all  gen¬ 
eralized  solutions,  and  made  significant  progress  in  corrective  actions  very  slow. 

Human  error  in  this  context,  has  been  defined  in  terms  of  four  categories: 

1.  Incorrect. performance  of  a  required  action 

2.  Failure  to  perform  the  required  action 

3.  Out-of-seauence  performance  of  a  required  action 

4.  Performance  of  a  non-reaui red  action. 

These  are  actually  cat  gories  of  mishaps  and  are  constrained  by  one 
qualification:  the  error  must  have  some  significant  effect  upon  the  system 
as  a  whole  or  upon  some  behavioral  component  of  the  system  or  it  must  be 
considered  as  not  significant.  Further,  "reversible"  errors  corrected 
prior  to  system  damage  are  not  significant.  Human  Error,  in  this  context, 
then,  has  been  defined  as  any  deviation  from  a  system  performance 
standard  which  is  caused  directly  or  indirectly  by  an  operator  and  which 
has  significant  consequences  to  the  system  operation  in  which  it  was 
made . 


Uk 

used 


in  system  design 


1c  it  is  worthwhile  to  take  the  categorical  definitions  of  human  error 
to  work  with  in  HERAP  so  that  a  continuity  of 

that^HE  RAP--  is-cone  e*wwd- wH-H- - 

opera ti ora  1  systems  in  oeing  and  both  "reversible"  and  "active"  error 
must  be  considered.  A  reversible  error  today  may  be  the  seed  of  tomorrow's 
accident,  as  in  a  near  miss.  Human  error  research  in  systems  design 
is  concerned  with  quantification  in  a  manner  similar  to  equipment,  so 
that  reliability  estimates  for  design  can  be  made.  In  the  prevention  of 
accidents  in  the  operational  phase  of  a  system,  some  other  categories  must 
also  be  considered. 


Some  current  Methods  of  Human  Error  Analysis  and  Predictability  have 
been  presented  by: 

1 .  Altman  et  al ,  1964 

2.  Swain  and  Rook  (1962  and  1963) 

3.  Miller  et  al ,  1964,  and 

4.  Irwin  et  al ,  1964. 

All  but  Irwin's  are  based  upon  Altman's  "A.I.R.  Data  Store."  This  store 
was  probably  the  first  attempt  to  manipulate  performance  data  for  use 
in  the  development  of  an  operability  index  for  prediction  of  human  perfor¬ 
mance.  As  Altman  himself  points  out,  it  is  generally  highly  limited,  but 
for  those  specific  tasks  listed,  it  is  considered  to  be  excellent.  The 
operability  index  was  built  with  three  objectives: 

1.  It  would  be  task  oriented. 

2.  Scores  from  the  operability  index  would  be  directly  meaningful  in 
terms  of  time  and  error  of  performance  (speed  and  reliability). 

3.  Data  on  speed  and  reliability  of  operator  performance  would  be 
obtained,  insofar  as  possible,  from  available  experimental  and 
field  data. 


The  basic  units  for  evaluation  are  the  task  elements,  whic^  are  defined  as 
inputs,  mediating  processes,  and  outputs. 

The  method  of  using  the  data  store  was  as  follows; 

1.  Equipment  design  information  and  data  obtained  from  task 

analyses  must  be  analyzed  into  behavioral  steps  and  sequenced 
by  mission  phases . 
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2.  EaCn  upcrdLiOi'i  must  be  Subdivided  to  'is  relevant  components, 
parameters,  and  dimensions  for  each  behavioral  step. 


3.  Each  of  these  dimensions  most  be  assigned  a  reliability  estimate 

from  the  data  store.  These  estimates  are  then  used  to  derive  scores 
for  units  of  behavior,  such  as  steps,  tasks,  phases  and  entire 
mi ssions . 

In  general,  time  scores  were  the  sum  of  individual  time  estimates  for 
eacn  dimension,  while  reliability  scores  were  the  product  of  individual 
reliability  estimates  for  each  dimension.  The  total  output  was  a  reliability 
score  which  was  an  estimate  of  the  probability  that  serious  operator  error 
would  not  occur  within  tne  span  of  performance  encompassed  by  the  score. 

THERP  (Technique  for  Human  Error  Rate  Predictions),  was  an  extended 
utilization  of  Altman's  Data  Store  by  Swain  and  Rook  in  the  form  of 
an  iterative  procedure  of  5  steps  which  were  repeated,  (though  not 
always  in  the  same  order),  until  tne  system  degradation  resulting  from 
human  error  was  at  an  acceptable  level.  These  five  steps  were: 

1.  Define  the  system  or  subsystem  failure  which  is  to  be  evaluated. 

2.  Identify  and  list  all  the  human  operations  performed  and  their 
relationships  to  system  tasks  and  functions. 

3.  Predict  error  rates  for  each  human  operation  or  group  of 
operations  pertinent  to  the  evaluation. 

4.  Determine  the  effect  of  human  errors  on  the  system 

3.  Recommend  changes  as  necessary  to  reduce  the  system  or 
subsystem  failure  rate  as  a  consequence  of  the  estimated 
effects  of  the  recommended  changes. 

As  Swain  points  out,  the  steps  are  typical  of  the  usual  syster  reliability 
study,  if  one  substitutes  Humans  for  Hardware." 

A  third  method,  developed  by  Miller  employed  a  computer  technique  to  generate 
distributions  of  operator  performance  times.  Tasx  performance  is  simulated 
by  sampling  the  distrioution  of  times,  using  Monte  Carlo  methods,  during 
10  computer  runs. 

Miller's  approach  to  operator  simulation  was  based  on  four  assumptions ; 

1.  The  human  transfer  function  depends  upon  the  individual's 

capability,  his  operating  environment,  and  the  nature  of  the 
task  requirements.  Therefore,  to  describe  the  operational 
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behavior  of  a  man-machine  system  req  'es  the  simulation  of 
groups  of  operators  and  maintenance  personnel  of  varying  capability^ 

_ denrgg  of  t^ainlng  and’  level  of  mot  i va  ttony  spcrQtl'ng'urrdfef  'tMfe  “ 

range  of  variability  of  conditions  and  stresses  encountered 
in  the  operation  environment, 

2.  Individuals  tend  to  perform  in  a  nonlinear  fashion.  In  deriving 
the  model  varying  paths  of  task  accomplishment  were  provided,  the 
number  of  paths  being  a  function  of  all  possible  alternatives  of 
task  execution  available  to  the  operator. 

3.  The  performance  of  a  subtask  by  operators  classified  by  skill 
level  can  be  described  by  time  and  success  probability  distribu¬ 
tions.  and  these  distributions  aoproach  normality.  The  assumption 
of  normality  is  not  a  limiting  factor  but  merely  one  of  convenience 

4.  The  path  of  task  accomplishment,  the  performance  times,  and 

the  success  probabilities  were  dependent  on  psychological  stress 
induced  by  the  operating  environment,  task  accomplishment,  and  work 
load,  and  this  stress  could  be  an  organizing  or  disorganizing  agent 
■behavior,  deoending  on  stress  level  and  operator  stress  threshold. 

The  method  developed  by  Irwin  was  one  in  which  judges  ratings  were  used 
to  develop  human  reliability  data  estimates  when  other  data  (such  as  the 
data  store),  were  not  available.  His  method  was  as  follows: 


1.  Specify  tasks  that  must  be  performed. 

2.  Identify  the  task  elements  necessary  to  accomplish  the  total  task. 

3.  Determine  the  performance  reliability  for  each  task  element. 


4.  Perform  a  preliminary  rating  study. 


Perform  a  man- rating  study. 


Though  the  use  of  a  rating  scale  may  seem  to  te  the  crudest  form  of 
elemental  psychophysics,  work  by  Whitlock  indicates  that  a  proper  choice 
of  judges  (more  than  6)  utilized,  and  no  more  than  40  items  to  be  rated 
results  in  surprisingly  high  validity  when  tested. 


Ore  of  the  major  problem  areas  for  HERAP ,  which  is  less  for  system 
design  predi ctabi 1 i ty ,  is  tne  consideration  of  additional  data  that  may  be 
pertinent  to  human  error  prevention  where  mishaps  are  concerned.  A 
partial  listing  from  Kock  and  from  Sells  of  some  of  the  stimulus  variables 
to  be  considered  are: 


on 


1.  Natural  aspects  of  the  environment: 


Weather-temperature ,  humidity,  seasons,  climate 

Terrain-alti tude ,  erosion 

Food 

- Clothing - 

Natural  resources 

2.  Description  of  the  task  situation: 

Area  and  level  of  knowledge  and  skills  required 
Hazards  and  risks  involved 

Novelty  of  situation  in  relation  to  prior  experience 

Procedures  permitted 

Information  required  and  available 

Number  of  participants  present  and  available 

Material  and  facilities  required  and  available 

Degree  of  personal  contact  involved 

Role  expectations  of  other  persons  concerning  the  individual 

3.  Definition  of  the  individuals  relation  to  the  situation: 

Degree  of  freedom  vs.  restriction  in  group  activities 
Degree  of  competion  vs.  cooperation  required 
Degree  of  friendliness  vs.  hostility  required 
Status  hierarchy  position  required 

4.  Description  of  other  persons  in  the  situation: 

Age  and  sex  of  participants 

Social  and  economic  background  of  participants 

Skills,  abilities,  and  experience 

Training  and  motivation 

New  or  previous  acquaintances 

Pre-existing  relationships 

5.  External  reference  characteristics  of  the  individual: 

Biological  -  sex,  age,  physique 
Physical  abnormalities  or  injuries 
Race 

Education 
Marital  status 

Citizenship,  legal  restraints 

Geographic  position-rural,  urban,  national 

Social  status 

Income  and  occupation 

Residence  and  transportation 

Debts  and  savings 

Status  and  role  in  family  group 

Parents  -  religion,  age,  health,  language 
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Siblings  -  age,  sex,  type 
Primary  group  and  children 

Group  memuersrn  p$  -  mjiirije r ,  type  str  uctur  e 
Status  and  role  in  group 


6.  Individuals  performing  relative  to  others; 

Formal  group  structure  -  goals,  objectives,  and  facilities 

Membership  requirements  -  age,  sex,  race 

Requirements  concerning  experience,  training,  achievement 

Group's  control  of  members  -  SOP's,  regulations 

Group  procedures  -  meetings,  staffing 

Dependency/cooperation  of  other  groups 

Formal  role  in  group  •  prestige,  power,  privilege 

Site  requirement  of  group  -  location,  space  facilities 

Informal  group  structure  -  goals  and  objectives 

Group  member  ship  requirements  -  social  class,  demography 

Group  control  of  membership 

Regulations  of  group  procedures  -  activities,  participation 
The  group's  significance  to  its  members 
Cohesion  of  the  group,  functional  relationships 
Collective  situations  -  crowds,  mobs,  audience 

This  partial  list  indicates  the  great  number  and  range  of  stimuli  that  are 
suggested  to  be  relevant. 

If  the  term  “human  error"  is  to  survive  over  a  long  term  program,  then 
a  more  precise  definition  is  required  which  reduces  value  judgments  to 
their  proper  place  as  adjuncts  to  objective  measures  which  can  effectively 
be  used. 

The  definition  proposed  for  this  purpose  was  "Human  error  is  the  specified 
deviation  of  a  human  activity  or  decision  from  an  op~eTat~ionalTy  defined  norm. 

This  says,  in  effect,  that  an  error  cannot  occur  unless  limits  have  been 
set  before  a  mishap  occurs.  This  is  a  necessary  condition  for  setting  up 
any  methodology  for  recording,  manipulating  and  acting  upon  data  in  a 
meaningful  way.  In  this  way,  too,  it  can  be  argued  that  although  the 
operational  definitions  may  be  modified  and  their  number  increased  with 
time,  no  groundless  judgments  can  be  made. 

In  Naval  aviation,  there  are  many  circumstances  where  mishaps  occur  in 
conditions  where  it  is  perfectly  clear  that  the  human  operator  has  had  no 
chance  to  mitigate  or  even  influence  the  result.  In  these  cases,  there  is 
no  question  of  immediate  mistakes,  although  relationship  to  a  remote, 
prior  influence  may  be  traceable.  It  is  a  truism  that  in  any  system  failure 
the  failure  may  be  ultimately  traced  to  a  human  source.  If  there  is  a 
wing  root  failure  or  an  equivalent  physiological  catastrophe,  we  may 
consider  these  conditions  unmodi fiable  by  the  human  operator. 


r-in 


circumstances  which  are  more  difficult  to 


.  A  *v.  s  w,  * _ _ 
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Between  these  limits  are 

describe  and  define.  Considerably  iiiOre  unto, 
are,  and  have  been,  involved  in  the  study  of  equipment  failure  compared  with 

- -human,  -failure,  "— J-his--iS— not ...surpr-isJng,— for.  me.th.Qd$._o.f  .me.a.s,u.reme.nt.r, - 

definable  variables  and  access,  each  provide  greater  ease  of  manipulation 
in  this  area.  Vet,  as  we  probe  deeper  into  the  equipment  failure  concepts, 
we  find  that  here,  too,  dichotomous  judgments  are  not  acceptable  in  most 
cases.  The  concept  of  "tolerance"  states  that  an 
and  should  have  been  produced  and  tested,  so  that 
to  its  function  in  such  a  way  that  failure  should 
tolerance  limits  are  exceeded.  If  the  item  fails 
specified  and  required  to  do,  it  may  have  done  so 
meet  the  tolerance  limits  in  its  quality  or  that  these  limits  have  been 
exceeded  in  operation  in  such  a  way  that  it  is  probable  that  it  would  fail. 
This,  again,  may  be  related  to  the  conditions  that  the  tolerances  have 
not  been  set  correctly  for  the  function  required  to  be  performed,  or  that 
the  item  has  not  been  tested  to  determine  if  it  does  fall  within  the  limits 
set.  It  is  clear  that  here  there  is  a  range  of  performance  capability  set 
up  which  allows  some  operational  freedom  in  the  use  of  the  item.  One 
may  also  expect  that  this  range  may  be  described  in  a  distributive 
manner,  that  is,  the  specified  quality  or  qualities  can  be  shown,  over 
large  samples,  to  fit  some  descriptive  transfer  function,  such  as  a 
Gaussian  or  similar  relationship. 


item  has  been  desiqned 
its  performance  Is  related 
not  occur  unless  the 
to  perform  as  it  is 
because  either  it  does  not 


Furthermore,  in  equipment  parlance,  there  also  occurs  the  concept  of 
expected  failure,  related  to  use  over  time.  In  this,  a  historic  or  empirical 
approach  will  result  in  probability  judgments  that  there  is  such  and  such  a 
"mean-time-to-fai lure"  for  this  item,  or  some  similar  predictive  state¬ 
ment.  Here,  items  may  be  replaced  before  some  value  of  the  distribution 
of  this  parameter  is  reached  on  the  assumption  that  actual  failure  of  the 
item  will  be  reduced.  Alternatively,  inspections  may  be  made  at  specific 
points  along  this  distribution  in  order  to  search  for  indications  that  an 
item  is  approaching  tolerance  limits  with  time  and  use. 

If  we  are  to  consider  the  effects  of  the  human  operator  within  the  system, 
it  is  necessary  to  interface  with  the  methods  and  descriptions  used  in 
dealing  with  the  equipment  he  uses,  for  few  mishaps  occur  within  the 
"unmodifiable"  category  described  earlier  and  there  is  very  little  that  can 
be  done  about  them  within  our  terms  of  reference.  On  the  other  hand,  the 
greater  part  of  our  study  must  fall  within  the  Interface  area  where  both 
operators  and  equipment  are  contributing  to  activity  and  decision  within 
a  system. 


In  summarizing  the  equipment  "approach",  however,  there  is  little  to 
disagree  with,  and  nothing  which  contradicts  our  proposed  definition  of 
human  error.  Qualities  in  operators  can  be  shown  to  be  distributed 
across  samples,  to  have  tolerances  which  can  be  exceeded  and  result^  in 

#>  _  u  -  i.  ■  «  «*  m  vs  *  M  knf  n  V’O  a  Htirinil  CAfl  [)  H  fTllSfc 

Tdliure,  uail  uc  LBStcu  anu  injpcvwt.w  ^  ~  ^  ~  - -  ' ~  .  .  , 

matched"  where  the  use  does  not  correspond  to  the  qua  1 ity  selected  and 
specified.  This,  of  course,  does  not  imply  that  we  should  treat  human i 
operators  in  tiie  same  way  that  we  do  equipment,  there  are  oiner  ciesi.nn.or 
and  qualities  which  will  not  fall  within  these  contexts  (such  as  motiva¬ 
tional  factors),  but  we  can  assure  ourselves  that  comiiona  1 1  ty  over  much 
of  our  study  area  can  be  assured  for  the  purposes  of  prediction,  descrip¬ 
tion  and  diagnosis,  with  equipment  methods,  allowing  our  interface  to 
occur  without  too  much  difficulty. 


In  some  recent  studies  of  "human  error",  notably  in  highly  structured 
weapon  systems,  it  has  become  common  to  attach  criticality  values  to  error 
incidence.  That  is,  it  is  recognized  that  an  "error"  (in  the  loose  sense) 
can  occur  in  any  of  n  ci rcumstances ♦  but  that  in  some  circumstances  it 
does  not  matter  and  Tn  others  it  will  matter  a  great  deal.  Thus,  error 
orobaci 1 i ty  is,  in  these  approaches,  linked  to  a  level  of  criticality.  In 
the  context  of  everyday  military  aviation,  we  may  illustrate  this  by  stat¬ 
ing  tnat  there  is  a  known  and  finite  probability  of  misreading  a  specific 
altimeter,  by  a  specific  population  of  pilots,  with  an  error  of  1000  feet. 
At  higher  altitudes  it  is  unlikely  that  this  will  be  "critical"  (except 
under  special  circumstances),  but  at  1000  feet  altitude  it  certainly  will. 
From  this  point  of  view,  it  is  necessary  to  relate  the  error  to  the 
activity  in  the  operational  context;  that  is  in  the  "equipment"  context, 
the  "tolerance"  of  the  quality  of  Instrument  reading  caoability  could  be 
variable.  It  is,  however,  easy  to  be  confused  with  the  worth  of  this 
conceot.  In  a  highly  structured  weapon  system  wnere  tnrre  are  probably 
nary  operators,  and  these  can  be  selected  and  trained  and  utilized  in 
different  ways,  it  may  well  be  valuable  to  introduce  a  criticality  factor, 
in  military  aviation,  where  numbers  are  fewer,  where  training  and  selec¬ 
tion  are  common  to  tiie  population,  and  where  the  pilot,  for  example,  has 
consiaeraLie  freedom  in  the  operation  of  his  vehicle  system,  there  may 
oe  less  value.  In  this  case,  "error"  must  be  reduced  to  a  minimum,  for 
all  occasions,  because  it  is  not  always  known  when  it  might  or  might  not 
be  critical.  A  further  point,  especially  in  this  example,  is  that  the 
pilot  may  not  (and  more  often  than  not,  does  not)  know  that  he  has  misread 
unless  something  results  from  the  misreading.  The  attachment  of  criti¬ 
cality  measures  tnen ,  should  be  considered  with  care  in  the  context  of 
a v i  ation. 


Our  approach  tnen  was  to  attempt  to  specify  what  we  expected  of  our  human 
operators  in  precise  qualities  which  tney  must  possess  to  carry  out 
specific  tasks  in  specific  systems.  Only  in  tnis  way  could  be  determine 
wnen  something  unusual  had  happened.  Tnere  are  various  ways  of  setting 
aoout  this.  Unfortunately,  we  were  not  beginning  from-  the  start  of  an 
organization,  we  were  forced  to  aeal  wi  tn  an  existing,  hignly  formalized 
structure.  Thus,  we  were  necessarily  empirical  in  much  the  same  way  that 
equipment  interests  are.  rlhat  are  the  distributions  of  qualities  among 
the  existing  population  of  aviators  and  other  associated  personnel  in 
regard  to  the  tasks  they  now  nave  to  perform? 
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"Error"  was  examined  then,  from  the  point  of  view  of  a  deviation  from  what 
could  be  expected,  but  blame  was  not  necessarily  attached  to  the  individual 
for  the  deviation. 

Once  again,  commonality  between  equipment  failure  analyses  and  human  error 
ansiycoc  techniques  should  be  borne  ’ r  m'pd.  tm«  rndtunrul  it.y  should  permit, 
relatively  easy  integration  of  a  larger  human  error  and  existing  equipment 
failure  and  safety  program. _ 


THE  HERAP  STUDIES 

As  a  result  of  the  HERAP  Planning  Study  several  areas  of  safety  research 
related  to  human  error  were  recomnended  as  especially  suitable  for 
immediate  investigation.  They  included  fatigue,  and  accident  exposure  index 
and  carrier  landings.  The  results  of  those  studies  are  described  elsewhere 
in  th;s  report. 

The  first  planning  study  [HERAP  I)*  included  a  representative  task  analysis 
with  comments  and  recommendations  for  further  uses  of  this  method  as  an 
analytical  tool.  HERAP  II  included  an  extensive  task  analysis  of  an  A-4 
mission  and  beginning  development  of  data  handling  and  analysis. 

HERAP  III  included  an  extensive  study  of  the  relationship  of  flight  Dhase  to 
accidents.  Task  analyses  of  representative  U.  S.  Navy/U.  S.  Marine  Corps  A-4 
aircraft  missions  were  made.  The  missions  were  divided  into  static-incident 
to  flight,  taxi,  takeoff,  inflight  and  landing  phases.  The  landing  and 
takeoff  chases  each  have  separate  analyses  made  for  carrier  and  field  operations. 

Computer  tabulations  were  made  of  A-4  accidents  and  incidents  for  fiscal  year 
1967.  The  mishaps  were  tabulated  to  graphically  indicate  the  number  of  accidents 
and  incidents  which  occurred  during  each  phase  of  flight.  Analyses  were  made 
for  the  A-4  A/5/C  models  and  for  the  A-4  E/P  models.  Comparisons  were  made 
of  the  numbers  of  mishaps  occurring  for  tne  various  flight  phases.  This  work 
is  summarized  in  Appendix  A,  Figures  1  through  8. 

The  causes  of  mishaps  are  summarized  under  field  and  carrier  operations 
and  under  flight  phases.  Further  analyses  were  made  of  pilot  error  causes  of 
mishaps  for  both  carrier  and  field  operations  during  landing. 

A  task  analyses  form  was  developed  to  show  graDhically  the  phases  of  flight 
which  have  high  mishap  potential.  Results  of  the  analyses  show  that  the 
greatest  number  of  mishaps  occur  during  the  landing  phase.  Carrier  operations 
show  a  higher  rate  of  mishaps  than  field  operations  which  is  a  generally 
accepted  fact.  A-4  aircraft  accidents  are  summar  .zed  in  Tables  I  and  II. 


*The  HERAP  studies  nave  arbitrarily  been  assigned  number  I,  II,  III,  and  IV 
to  differentiate  between  contracts  as  follows: 


HERAP  I 
HERAP  II 
HERAP  III 
HERAP  IV 


Contract  N189-(188)-59791A 
Contract  N001 89-67-00320 
Contract  N00189-68-G0565 
Contract  N001 89-70-C- 1 321 


8/65  to  2/66 
9/66  to  9/67 
10/68  to  3/69 
10/69  to  4/70 
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COMPARISON  OF  PILOT  ERROR  CAUSE  TO  OTHER  CAUSES  OF  MISHAPS 
FOR  THE  OVERALL  FLIGHT  PHASES  AND  FOR 

tffNtTNG  PHASE  StTA'RATflT"  . 

A4  AIRCRAFT 


FIELD  OPERATIONS: 

PILOT  ERROR 

OTHER  CAUSES  * 

ACCIDENTS 

24 

41 

36 . 9% 

63.  IS 

INCIDENTS 

110 

330 

25% 

75% 

TOTALS: 

65 

100?, 

440 

100% 

CARRIER  OPERATIONS: 

PILOT  ERROR 

27 

20.45:1 

27 

18.49% 

other  causes 

105 

79 . 552 

119 

81  .51% 

TOTALS: 

132 

100% 

146 

100% 

FIELD  PLUS  CARRIER 

OPERATIONS: 

PILOT  ERROR 

51 

25.89? 

137 

23.38% 

QrHER  CAUSES 

146 

74.11% 

449 

76.62% 

TOTALS : 

197 

1 00% 

586 

100% 

LANDING  PHASE  - 
FIELD  OPERATIONS 

PILOT  ERROR 

7 

50% 

25 

39.84% 

OTHER  CAUSES 

7 

50% 

38 

60.16% 

TOTALS: 

14 

100% 

63 

100% 

LANDING  PHASE  - 
CARRIER  OPERATIONS 

PILOT  ERROR 

19 

54.29% 

9 

34.61 7 

OTHER  CAUSES 

1 6 

45.71% 

17 

65.39% 

TOTALS: 

35 

100% 

26 

1 00% 

Other  Cause  Factors  include  combat,  airbase/carrier  facilities,  other  personnel 
(maintenance/supervisory) ,  material  failure/malfunction,  weather,  unavoidable  a 
undetermi ned/other  causes. 
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TABLE  II 


COMPARISONS  OF  ACCIDENTS  AND  INCIDENTS  BY  FLIGHT  PHASES 


ft  A  ft  )Tt  /  /I-  it-  »  *  n 

n/ua/L/r  MirtiKHri 

- 1 

U 

!  1 
FIELD  OPERATIONS,  _ _ |  OM1£B-OEE«AUONS~-- - 

FLIGHT  PHASES 

ACC  ID 

* 

INCID 

- - 

ACCID 

V 

INCID 

% 

Part  A: 

“  ].  '  STATIC 

0 

0 

9 

2  A 

0 

3 

2% 

2.  TAX! 

1 

1  .  S': 

1 1 

2.5" 

* 

1.5"' 

3 

2% 

3.  TARE  OFF 

5 

3- 

36 

ST 

8 

6.02% 

15 

10% 

A.  INFLIGHT  * 

42 

65.5" 

316 

7  20 

87 

65.41% 

99 

68% 

5.  LANDING 

14 

22% 

63 

14.3* 

35 

26.32% 

26 

18% 

c.  wAVE  OFF 

0 

0 

3 

0.7% 

0 

0 

0 

0 

S.  LET  DOWN 

2 

3% 

2 

0.5% 

1 

0.75% 

0 

0 

TOTALS  FLIGHT  PHASES: 

64 

1 00  D 

440 

100% 

133 

1 00% 

• 

146 

100% 

Pari  E : 

Net  Incident  To  Flight  and 
L'naetemi  ned  Pnases  Added: 

9.  NOT  INCIDENT  TO  FLIGHT 

48?: 

4 

0.88;. 

23 

15% 

1 

0.66% 

C,  UNDETERMINED 

0.8% 

12 

2.6% 

1 

0.6% 

L _ 

5 

3.3% 

TOTALS  ALL  PHASES: 

122 

- 

loot: 

456 

152 

100% 

*  Part  Cl 

INFLIGHT  DIVIDED  BETWEEN: 

A.  ENEMY  ACTION,  AND, 

15 

33.7  % 

87 

21% 

68 

36 

36% 

b.  NOT  ENEMY  ACTION 

27 

66.3?? 

229 

73% 

19 

63 

64% 

TOTALS  INFLIGHT: 

42 

1  COT 

316 

1 OO'”- 

87 

TOOL 

99 

100% 

*  INFLIGHT  MISHAPS,  PART  A-4,  HAS  BEEN  FURTHER  SUBDIVIDED  IN  PART  C,  A,  AND  B. 
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Fatigue  la  frequently  cited  as  a  major  contributor  to  accidents.  The  wide 

a  C  Cep  to  nee  of  this  a  S  5  u  'Tie  u  relationship  IS  nut  Stirprislny.  *luSt  peuple 

have  experienced  "fatigue"  more  than  once,  and  subjective  feelings  of  drowsl- 
ness,,  apathy.,  weakness,  etc. .  certainly  "ought"  to.  be  precursor,  so  f_ _ _ _ 


accidents . 


Operational  requirements  frequently  make  it  necessary  or  desirable  to  pro¬ 
gram  extended  and/or  very  frequent  activities.  There  is  undoubtedly  a  limit 
to  the  workload  that  men  can  assume  without  seriously  influencing  the  safety 
and/or  probability  of  successful  completion  of  the  assigned  task. 

It  would  be  desirable  to  supply  medical  personnel  and  leaders  with  valid  and 
reliable  guidelines  of  performance  changes,  as  well  as  predictors  or  symptoms 
of  these  changes  accruing  for  a  given  workload.  Attempts  to  define  the 
effects  of  fatigue  under  controlled  conditions,  however,  have  been  generally 
unsuccessful . 

A mono  tne  factors  that  have  probably  contributed  to  the  lack  of  definitive 
results  obtained  in  studies  of  fatigue  are  the  following: 

1.  Complexity  of  the  human  organism,  with  its  ability  to  draw  upon  extensive 
resources . 

c.  Vagaries  of  motivation  and  Dhysical  condition. 

3.  Effects  of  age  and  training. 

4.  Similarity  of  the  symptoms  of  fatigue  and  those  of  illness. 

5.  difficulty  of  separating  the  effects-  of  other  types  of  stress  from  fatigue 

In  order  to  delimit  the  scope  of  the  problem,  fatigue,  for  the  purpose  of 
tnis  study,  was  defined  operationally  as  "the  degradation  in  performance  or 
affective  state  resulting  from  previous  work."  The  purpose  of  this  study 
was  to  initiate  a  systematic  program  intended  to  ultimately  achieve  the 
fol lowing  goal s: 

1.  Determine  performance  parameters  (in  tasks  representative  of  those 
required  of  Navy  personnel),  influenced  by  continued  heavy  workload. 

2.  Determine  the  workload  which  results  in  the  onset  of  a  degradation  in 
oerformance . 

3.  Determine  the  magnitude  of  impairment  for  two  selected  workloads. 

4.  OeFine  physiological  and/or  behavioral  indices  of  the  onset  of  fatigue. 

5.  Develop  an  understanding  of  the  underlying  mechanisms  involved  in  per¬ 
formance  degradation. 


6.  Discover  methods  of  mitigating  or  alleviating  degradation  without 
adverse  side  effects. 


Concl usions 


1.  The  subjects  were  fatigued  only  slightly  by  the  conditions  encountered 
in  the  high  WL/treatment . 

2.  No  mean  performance  changes,  resul ted  in  any  performance  measure. 


3.  Pilot  tracking  data  supported  the  increased  variability  and  perceptual 
breakup  hypothese . 

4.  Physiological  measures  generally  reflected  changes  in  workload.  Particu 
larly  promising  are  respiration  rate  and  sleep  recovery  data. 

a.  The  pattern  hypothesis  was  supported  but  cannot  be  used  indiscriminately 
the  most  affected  parameters  must  be  determined  for  the  combined  data 
treatment  to  be  effective. 

£.  The  phorias  are  worth  investigating  in  greater  depth  as  indices  of 
fa  t i gue . 

Recommendations 

1.  A  parametric  study  of  work  load  should  be  conducted  to  determine  the 
amount  or  the  duration  of  work  that  can  be  performed  before  serious 
performance  degradation  results. 

2.  Control  over  the  complete  situation  for  basic  studies  would  be  easier 
in  a  ncn-operational  situation.  Therefore,  serious  consideration 
should  be  given  to  conducting  additional  exploratory  studies  using  non- 
ooerational  or,  at  least  less  complex,  tasks. 

3.  Ar  inflight  study  should  be  performed  to  verify  the  6-hour  mission 
duration  acceptability  for  tactical  ASW  crews  in  aircraft  such  as  the 
S-2. 
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PART  II 


MISHAP  DATA  ANALYSIS 


INTRODUCTION 

A  considerable  body  of  data  describing  the  personal  histories  and  flying  exper¬ 
ience  of  Navy  pilots  has  been  built  up  over  the  past  years.  These  data  range 
over  a  wide  variety  of  factors  which  could  conceivably  be  related  to  how  safely 
pilots  perform.  The  main  thrust  of  the  HERAP  data  analysis  program  for  the 
1969  -  1970  contract  was  to  systematically  test  the  existing  data  for  significant 
relationships  with  the  criterion  of  "pilot  safety." 

Linear  regression  (linear  least-squares  curve  fitting)  was  used  to  test  for 
these  relationships.  The  variables  on  the  data  bank  were  used  to  try  to  predict 
a  numerical  measure  of  flying  safety  for  each  pilot.  The  error  in  the  predic¬ 
tions  were  then  analyzed;  that  is,  the  discrepancies  between  predicted  and 
Observed  pilot  safety  were  analyzed.  If  there  were  no  error,  the  predictions 
would  be  perfect  and  it  could  be  confidently  said  that  there  was  a  relationship 
between  pilot  safety  and  at  least  some  of  the  variables  on  the  data  bank.  If 
the  error  in  the  predictions  turned  out  to  be  as  great  as  the  intrinsic  scat¬ 
ter  in  the  observed  pilot  safety  measure,  it  could  be  said  with  equal  confi¬ 
dence  that  no  such  relationship  existed.  The  actual  situation  is  almost  always 
somewhere  between  these  two  extremes,  but  statistical  techniques  can  he  employed 
to  allow  some  statement  about  the  probability  of  a  relationship  existing  between 
the  measure  of  pilot  safety  and  the  data  on  individual  pilots. 

This  analysis  proceeded  in  three  successive  steps:  development  of  a  data  bank, 
development  of  a  measure  of  pilot  safety,  and  the  actual  regression  analyses. 
These  are  discussed  in  the  following  sections. 

THE  DATA  BANK  SYSTEM 

In  the  years  of  its  existence,  the  Naval  Safety  Center  has  collected  an  enormous 
amount  of  data.  These  data  have  served  as  the  source  of  numerous  studies.  As 
the  data  have  grown,  so  have  the  problems  of  using  them.  The  sheer  quantity 
had  made  comparative  studies  a  near  impossibility  unless  the  study  utilized  a 
sampling  technique  to  limit  the  data  management  problem. 

Douglas  Aircraft  Company  began  the  data  analysis  for  HERAP  by  investigating 
and  listing  all  the  data  sources  available  during  1967  (HERAP,  2nd  Quarterly 
Progress  Report).  From  this,  a  number  of  sources  were  selected.  They  were: 


1. 

Naval  Safety  Center 

Mishap  Master  File 

2. 

Naval  Safety  Center 

Individual  Flying  Time  Record 

3. 

Naval  Training  Center 

Training  Records 

4. 

Bureau  of  Naval  Personnel 

Personal  Data 

Data  files  on  computer  tape  were  obtained  from  these  sources,  translated  to 
make  them  compatible  with  Douglas'  computers  and  merged  into  a  single  data 
bank  after  editing. 


Best 
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Copy 


The  data  system  is  used  on  an  IBM  360/65  computer.  The  statistical  programs 
are  basically  the  University  of  California  at  Los  Angeles  Bio-medical  Programs 
(BMD).  Cobol  is  the  programming  language  for  file  maintenance  and  report 
generator  programs,  and  Fortran  IV  is  the  language  used  for  callable  selected 
mathematical  subroutines.  Details  of  the  operation  of  the  HERAP  data  bank 
and  statistical  programs  are  fully  reported  in:  Human  Error  Research  and 
Analysis  Proaram  (HERAP  II)  Final  Report.  Douglas  Aircraft  Company,  September 
1967. 

Data  gathering,  translation,  and  editing  lias  continued  during  all  the  HERAP 
contracts.  However,  the  major  task  in  development  of  the  data  bank  has  been 
the  selection,  translation,  modification,  and  testing  of  suitable  statistical 
programs.  This  was  completed  in  1967.  The  merged  master  data  bank  was  comple 

ted  in  1968  and  tested  against  a  dummy  data  file. 

During  the  current  contract,  the  whole  system  has  been  fully  utilized.  The 
system  was  used  to  study  the  backgrounds,  flying  records,  training  records, 
and  mishap  reports  for  17,436  Navy  pilots  active  during  1967.  The  data  were 
obtained  from  four  files:  the  Officer  Master  Tape,  the  Individual  Flight  Time 
Records,  the  Training  Record,  and  the  Mishap  Master  File. 

The  data  bank  (Figure  3  )  was  constructed  by  matching  the  other  files  to  the 

records  on  the  1967  Officer  Master  Tape  by  means  of  pilot  file  numbers.  The 
Officer  Master  Tape  (OMT)  contains  demographic  data,  records  of  advancement 
through  rank  and  of  past  and  present  duty  stations,  information  on  promotional 
status,  and  summaries  of  flying  experience. 


HERAP  DATA  BANK 


Total  Records 

17,436 

IFTR  Records 

10,598 

Training  Records 

4,374 

Mishap  Records 

895 

Figure  3. 

The  Individual  Flight  Time  Records  (IFTR)  report  the  number  of  flights  and 
flying  hours  in  1967  for  each  pilot,  broken  down  by  type  of  activity  and  typa 
of  aircraft.  These  were  available  for  only  10,598  of  the  pilots  on  the  OMT. 

The  Training  Record  describes  in  terms  of  scores  and  grades  the  pilot's 
progress  through  the  Pilot  Training  Curriculum  from  start  to  finish. 

The  Mishap  Master  File  (MMF)  is  a  data  bank  on  which  is  coded  all  of  the  data 
collected  in  the  course  of  a  mishap  investigation.  The  term  "mishap"  includes 
comparatively  minor  accidents  occurring  on  the  ground.  When  the  data  is 
keypunched.  Navy  Safety  Center  Personnel  do  not  keypunch  pilot  identification 
unless  the  mishap  is  judged  to  be  at  least  partially  attributable  to  the  pilot 
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Therefore,  since  the  HERAP  data  bank  was  built  up  by  matching  pilot  file 
numbers,  it  was  impossible  to  use  most  of  the  records  on  the  MMF .  From 
approximately  5,000  mishap  reports,  it  was  possible  to  match  only  895  "pilot 
error"  mishaps.  All  mishap  rates  used  in  this  study  were  calculated  on  the 
basis  of  these  895  mishaps.  Some  tabulations  were  made  to  further  determine 
th<»  type  of  mishaps  on  the  HERAP  data  bank,  as  shown  below.  Figure  4. 


TABULATION  OF  MISHAP.  PARAMETERS 


Aircraft  Damage 

"  of  Total 

Extent  of  Xnj.uJ7.  ! 

of  Total 

Total  Loss 

30 

Fatal ,  Missing 

10 

Substantial 

9 

Major 

5 

Minor 

14 

Minor 

12 

limi  ted 

17 

None 

74 

'lone 

Not  Recorted 

26 

5 

101 

W 

Due  to  Enemy  Action 

%  of  Total 

First  Accident  Type  “ 

of  Total 

Yes 

23 

Collision  -  Aircraft 

4 

No 

77 

Collision  -  Ground/Water 

10 

Too- 

Collision  -  Other 
Stall/Spin 

38 

1 

Type  Operation 

7c  of  Total 

Airframe  Failure 

7 

Fire/ Explosion 

3 

fclf/fmlp 

1 

Abandoned  Aircraft 

4 

C/  Operation/LPH 

47 

Landing  Mishap 

22 

Field  Operation 

36 

Other 

n 

Oea  Operation 

GCA/  IL; 

Ship  Operation 

Carrie^  Oual ificat’ons 

1 

3 

5 

4 

5 

102 

100 

Figure  4. 

Since  the  data  bank  occupied  three  reels  of  computer  tape  and  required  eight 
minutes  to  read,  a  smaller  data  bank  was  developed.  It  consisted  of  2,054 
randomly  chosen  records .  The  records  on  this  tape  were  shortened  by  one 
naif  by  dropping  fields  which  were  usually  blank  or  otherwise  unusable.  Most 
cf  the  statistical  analyses  was  done  with  this  data  bank. 


The  Exposure  Index  and  a  Measure  of  Pilot  Safety 


It  was  necessary  to  move  from  vague  Intuitive  concepts  of  what  constitutes 
a  ':safe::  or  an  "unsafe"  pilot  to  a  numerical  definition  of  pilot  safety. 

To  be  useful  in  the  search  for  relationships  with  the  wide  range  of  data 
measure -of—  p-» .  ot  saftfty-inrd-to-vary  contrtnuou's  t y  from 
"very  safe"  tnrougn  "average"  to  "very  unsafe".  Assuming  that  a  "safe" 
pilot  is  one  w'io  doesn't  have  an  excessive  number  of  mishaps  for  the 
amount  and  circumstances  of  the  flying  he  does*  two  different  techniques 
of  rating  pilots  were  developed.  "Either  of  these  rating  techniques  is 
just  a  mathematical  statement  of  an  intuitive  definition  of  pilot  safety. 

before-  discussing  the  pilot  rating  techniques,  it  is  necessary  to  consider 
the  interaction  between  pilot  safety  and  the  immediate  environment  of  the 
pilot  during  his  flight.  Parameters  suen  as  weather,  phase  of  flight, 
type  of  aircraft,  day  or  night,  etc.,  will  obviously  influence  the  proba¬ 
bility  t.oat  a  pilot  will  suffer  a  mishap.  The  purpose  of  the  human  error 
research  program  is  to  isolate  safety-rel ated  factors  intrinsic  to  the 
pilot;  factors  which  will  cause  some  pilots  to  consistently  have  more 
! ■  i sh aps  tnan  other  pilots  regardless  of  the  type  of  flying  they  do.  With 
the  goal  of  casing  the  pilot  safety  rating  free  of  the  effects  of  the 
environmental  variables,  an  exoosure  index  was  developed.  The  exposure 
index  was  originally  conceived  as  a  means  of  ranking  squadrons  in  terms 
■;f  safety,  but  it  became  clear  during  HERA?  II  tnat  tne  study  of  indepen¬ 
dent  pilot  safety  would  require  some  adjustment  for  exposure. 

Tne  casic  concept  behind  tne  exposure  index  is  tnat  pilots  flying  under 
the  same  conditions  should  be  expected  to  nave  mishaps  at  the  average  rate 
for  all  cl  lots.  The  rating  systems  compare  eacn  pilot's  actual  perfor¬ 
ate  To  this  exDected  Derformance.  In  orcer  to  calculate  expected 
performance  as  accurately  as  possiole,  it  is  necessary  to  find  tne  mean 
misnap  rates  within  each  possible  set  o f  flying  conditions.  To  find  these 
rates  requires  accurate  data  on  the  number  of  mishaps  and  the  number  of 
Mights  under  eacn  set  of  conditions.  Inis  oata  is  certainly  not  available 
since  no  two  flignts  are  under  the  exact  same  conditions,  so  lumping  into 
categories  is  required. 

uata  was  available  to  calculate  tne  mi  snap  rates  only  wnen  broken  down  by 
aircraft  type,  type  of  operation,  and  day  or  night.  The  Misnap  Master 
File  was  also  supposed  to  contain  complete  information  on  several  other 
environmental  factors  for  which  allowance  mignt  have  been  made,  but  these 
data  fields  as  shown  below  were  found  to  be  over  fifty  percent  blanks  and 
could  not  be  used.  There  is  no  data  on  the  number  of  flights  under  the 
various  conditions  described  by  these  fields,  so  that  at  best  the  data 
would  have  nad  to  come  from  some  inaccurate  guesses.  The  exposure  index 
could  oe  improved  if  the  missing  data  were  available. 
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Sample  of  data  fields  from  the  Mishap  Master  File  which  contained  over 
fifty  percent  blanks: 

Data  Percent  Blank 


Kind  of  flight 

85 

Clearance 

91 

Wind  velocity 

95 

Alti tude 

70 

Maneuver  prior  to  emergency 

97 

The  final  exposure  index  divided  the  types  of  flying  into  only  twelve 
categories  broken  down  on  the  basis  of  three  variables  (Figure  5).  The 
aircraft  types  were  grouped  into  three  categories  of  hazardousness  on 
the  basis  of  published  mishap  rates^.  Type  of  operation  was  divided 
into  "carrier  activity"  and  "non-carrier  activity".  These  combinations 
were  further  broken  down  into  "day"  and  "night"  flying.  The  overall 
mishap  rate  was  calculated  for  each  of  these  categories*  and  the  pilots 
were  rated  on  the  basis  of  their  performance  relative  to  these  standards. 


FLYtNG  CATEGORIES  AND  MISHAP  PROBABILITIES 
(PER  ID-4  LANDINGS) 


_ 
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; 

i 
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1 

U.  7 
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Figure  6 . 


1.  U.  S.  Navy  Aircraft  Accident  Statistics,  Fiscal  Year  1967  (Confidential) 


RANKING  OF  PILOTS  BY  SAFETY  RECORDS 


The  first  ranking  method  was  to  obtain  a  quantitative  answer  to  the  question, 
"What  is  the  probability  that,  by  chance  alone,  a  pilot  should  have  had  more 
mishaps  than  he  actually  had?"  Within  each  category  of  flying,  the  value  of 
the  cumulative  binomial  distribution  was  calculated,  where 


Cb(x)  -  £  B(x) 

x+ 1 


_ n ;  . 

x: (N-x) : 


given  x  -  number  of  mishaps  the  pilot  had  within  the  category 

N  =  number  of  landings  the  pilot  had  within  the  category 

p  3  average  mishap  probability  for  the  category  (from 
Figure  3) 

q  -  1-p 

\ 

A  value  of  CB(x)  was  calculated  for  each  category  in  which  the  pilot  flew, 
and  these  were  then  averaged  to  give  his  "binomial  ranking".  Figure  6 
shows  the  frequency  distribution  of  the  binomial  rankings.  Most  binomial 
rankings  are  nearly  zero,  meaning  that  for  most  pilots,  the  probability 
was  very  low  that  they  should  have  had  more  than  the  number  of  mishaps 
which  they  actually  had.  This  is  to  be  expected;  most  pilots  had  no 
mishaps  but  didn't  fTy  enough  to  really  be  expected  to  have  a  mishap. 

The  highest  binomial  ranking  value  was  only  .28. 

The  second  approach  to  ranking  pilots  was  based  on  the  question,  "How 
does  a  pilot's  mishap  rate  (mishaps  per  10*  landings)  compare  to  the  mishap 
rates  of  all  other  pilots.  An  arbitrary  formula  was  devised  which  has  the 
properties : 

(1)  Increases  in  value  with  an  increasing  number  of  expected 
mishaps , 

(2)  Decreases  in  value  with  an  increasing  number  of  mishaps, 

(3)  Equals  zero  if  a  pilot's  actual  number  of  mishaps  is  exactly 
equal  to  his  expected  number  of  mishaps. 

"Expected  number  of  mishaps"  is  defined  as  the  sum  over  all  flying  categories 
of  the  pilot's  actual  number  of  landings  multiplied  by  the  average  mishap 
rate  from  Figure  6,  i.e.: 


12 

Expected  Mishaps  =  SI  L.  x  p. 

L  =  1  1  1 


where,  for  each  pilot, 

L.  =  number  of  landings  in  flying  category  i 
p.  =  mishap  probability  for  flying  category  i 
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The  arbitrary  function  comparing  expected  to  actual  mishaps,  called  the 
"x-ranking" ,  was  calculated  for  each  pilot  (Figure  6).  Each  pilot's 
percentile  ranking  on  the  distribution  of  x-rankings  was  then  calculated 
(Figure  7).  This  percentile  ranking  was  used  as  the  variable  indicating 
how  a  pilot's  safety  record  compared  to  those  of  other  pilots. 

These  two  ranking  procedures  were  applied  to  the  condensed  data  Dank  of 
2,054  random, ly  cnosen  pilots.  There  were  sufficient  data  to  rank  only 
1,133.  of  these  pilots. - - - - - - - - 

Regression  Analysis 

An  extensive  analysis  was  conducted  to  determine  if  the  variables  In  the 
condensed  data  bank  could  be  used  to  predict  the  binomial  rankings  or  the 
percentile  rankings.  A  stepwise  regression  program  was  used  for  most  of 
this  work.  This  program  builds  up  a  linear  regression  equation  from  a  set. 
of  candidate  variables  by  successively  inserting  tne  variable  with  the 
nignest  probability  of  significantly  improving  the  fit  of  the  equation. 
Variables  from  the  Officer  .Master  Tape  and  the  Training  Record 'were  used 
as  candidate  variables.  Predictive  equations  were  developed  for  the  binomial 
ranking  and  tne  percentile  ranking  (Figure  8).  Meitner  of  these  equations 
is  really  satisfactory.  The  equation  predicting  binomial  ranking  lias  a 
multiple  correlation  coefficient  of  only  .58;  tne  other  equation  has  an  even 
lower  correlation  coefficient. 

In  the  equation  predicting  tne  binomial  rankings  the  last  year  of  combat 
code  has  a  correlation  coefficient  with  the  binomial  rankings  of  .52.  This 
ore  variable  accounts  *or  almost  all  of  the  predictive  ability  of  the 
equation.  Similarly,  date  of  birth  and  percentile  ranking  have  a  correla¬ 
tion  coefficient  of  .37,  accounting  for  most  or  the  predictive  ability  in 
the  seccna  equation. 

In  fcctn  equations,  a  group  of  variables  appear  which  relate  to  age  or  experi¬ 
ence.  This  group  includes  birtn  year,  rank,  total  flight  hours,  date  of 
birth,  and  fl  ignt  nours  for  the  past  five  years.  In  each  equation,  the 
overall  relationship  is  a  slight  decrease  in  a  pilot's  ranking  for  an 
increase  in  his  age  or  experience. 

The  variable  "last  year  of  combat"  was  set  to  zero  for  pilots  with  no  com¬ 
bat  experience,  so  that  it  owes  some  of  its  predictive  ability  to  being 
able  to  cistinguish  between  pilots  who  had  or  had  not  been  assigned  to 
combat.  Its  correlation  with  the  rankings  both  with  and  without  these 
zero  values  (Figure  9)  was  calculated  in  an  attempt  to  gauge  how  important 
the  zeros  are.  No  significant  change  was  noted. 

Comparison  of  Mishap  Distribution  to  a  Poisson  Distribution 

The  distribution  of  misnaps  among  the  population  of  pilots  was  tested  for 
randomness.  It  is  often  stipulated^  that  the  distribution  of  frequencies 


1.  Stinson,  P.  J.  and  Walsh,  J.  E.  (1965)  "An  Application  of  the  Poisson 
Adp^ox imation  to  Naval  Aviation  Accident  Data",  presented  at  the 
35th  Session  of  the  Internati onal  Statistical  Institute,  Belgrad. 
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PREDICTIVE  EQUATIONS 


1.  Binomial  Ranking  ■  -4,703 

<  1  I?  .  >  /  I..'  t.iU  1  .  .  k  A.  .  ..  J  A-  -  \ 

*r  ,  iuj  a  |uit  in  y  t:  cl  r  ,  iq^i  l  vtw  uiui  i .■*  j 

+  .005  x  (VTA  grade) 

-  ,C0C  x  (VT3  grade) 

*  .332  x  (rank  code) 

-  .261  x  (months  of  combat) 

*1,027  x  (last  year  combat,  last  digits) 

+  .079  x  (college  level  code) 

-  .166  x  (highest  award  code) 

-  .0001  x(total  flight  hours) 

R  -  .58.  R2  «  ,34 

2.  Percentile  Rankirg  -  -1.808 

■*  .00005  x  (date  of  birth,  days  since  1900) 

-  .024  x  (months  of  combat) 

+  .044  x  (last  year  of  combat,  last  digits) 

*  .008  x  (college  level  code) 

+  .00004  x  (flight  hours  last  five  years) 

R  »  .42,  P2  *  .17 


FIGURE  9 


CORRELATION  COEFFICIENTS  FOR 
"LAST-YEAR-  OF-COMBAT" 


WITH 

WITHOUT 

ZEROES 
(1133  cases) 

ZEROES 

(156  cases) 

Correlation  wi th 

Binomial  Rankings 

.52 

.47 

Correlation  wi th 

Percentile  Rankings 

.20 

.27 

FIGURE  in. 
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of  randomly  occurring  accidents  is  approximated  by  the  Poisson  distribution. 
The  observed  distribution  of  numbers  of  mishaps  for  the  total  pilot  popula¬ 
tion  was  used  to  first  calculate  the  mean  number  of  mishaps  for  all  pilots 
and  was  then  compared  to  the  Poisson  distribution  having  this  same  mean. 

The  theoretical  and  observed  frequencies  for'  each  number  of  mishaps  are 
presented  in  the  following  table. 


Tiu“cfr —  — 

Observed 

bxpec  It’d 

Mishaps 

F  requency 

f  requency 

0 

10541 

10456 

1 

1 

703 

850 

*> 

b7 

22 

3 

14 

0.4 

4 

L 

0 

-5 

0 

0 

6 

0 

0 

/ 

Q 

0 

a 

1 

0 

It  's  obvious  that  the  fit  is  not  a  good  one.  The  cni -squared  test  of  the 
di screpancies  cannot  oe  accounted  for  by  sampling  error. 

Histograms  giving  results  of  analyses  of  combat  experience,  recency  of  com¬ 
oat  r1’ght  hours,  cc1 lege  level,  birth  date,  military  rank  and 

aw«»*cs  are  shewn  by  Figures  Id  through  20. 
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Histogram,  Interval  is  0.10 


EXPECTED  MISHAPS 
RANDOM  SAMPLE;  Or’  PILOTS 

The  aist  r  ib’itwn  -T  ••xpe-il’-T  mishaps,  r.alc<iiatt:r|  by  t  h  «.•  metho’l  Hcsc  rilieil  in  the  text. 


FIGURE  12. 
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DISCUSSION  AND  CONCLUSIONS 


The  regression  analysis  demonstrated  that  the  pilot  factors  described  on  the 
data  bank  have  some  influence  on  pilot  safety.  One  way  of  interpreting  the 
multiple  correlation  coefficient  is  to  consider  its  square  (R2)  as  the 
proportion  of  the  variance,  or  scatter,  in  the  pilot  safety  rankings  which  is 
explained  by  the  variables  in  the  regression  equation.  Thus,  thirty  four 
percent  of  the  variance  in  the  binomial  rankings  can  be  explained  by  the 
influence  of  such  variables  as  age,  flying  experience,  and  combat  experience. 
Sixty-six  percent  of  the  variance  remains  unexplained. 

There  are  several  ways  in  which  the  multiple  correlation  coefficient  obtained 
in  this  analysis  might  be  improved.  In  the  interests  of  testing  a  large 
amount  of  data  in  a  systematic  fashion,  the  data  was  used  pretty  much  as  it 
came  from  the  Navy.  In  several  cases  we  attempted  to  impose  a  linear  relation¬ 
ship  on  variables  which  had  aribtrary  numbers  assigned  to  their  different 
levels.  It  would  be  desirable  to  either  reorder  these  levels  within  the 
variable  or  to  treat  each  possible  level  as  a  binary  variable  indicating  that 
a  pilot  either  does  or  does  not  have  a  particular  characteristic.  Plots  of  the 
residuals  against  each  variable  were  inspected  for  evidence  of  higher  order 
dependencies  of  the  criterion  on  the  predictor  variables;  no  such  evidence 
was  found. 

Another  way  to  improve  the  prediction  equations  would  be  to  further  improve 
the  exposure  index.  The  fact  that  the  safety  rankings  showed  some  dependence 
on  intuitively  meaningful  variables  such  as  pilot  experience  is  evidence  that 
the  exposure  index  has  successfully  removed  some  of  the  effects  of  the  different 
types  of  flying  environment,  but  doubtlessly  some  of  the  unexplained  variance 
is  still  due  to  this  effect.  Another  factor  contributing  to  the  lack  of  a 
high  multiple  correlation  is  the  spotty,  incomplete  nature  of  the  data  base. 
Perhaps  an  effort  to  collect  fewer  types  of  information  in  a  more  complete 
manner  might  be  productive. 

figure  $  shows  the  final  regression  equation  to  predict  the  binomial  rankings. 
This  ecuation  includes  every  variable  with  at  least  a  ninety  percent  probability 
of  actually  contributing  to  the  "fit"  of  the  equation.  Figure  22  is  a  summary 
of  the  increase  in  the  multiple  correlation  coefficient  as  the  computer 
program  moved  through  the  various  steps  of  curve  fitting.  The  first  two  or 
three  variables  entered  account  for  most  of  the  predictive  power  of  the  equation. 

St  should  be  remembered  that  an  unknown  portion  of  the  correlation  in  this 
eauation  is  due  to  characteristics  of  the  sample  population  which  are  not  true 
characteristics  of  the  total  population.  If  the  equation  were  to  be  applied  to 
some  other  sample  population,  it  would  not  account  for  as  much  of  the  variance 
as  it  does  for  this  population.  It  is  difficult  to  estimate  how  much  of  this 
"shrinkage"  will  occur.  Future  investigations  might  well  apply  this  equation  to 
some  other  population  in  a  cross  validation  to  measure  the  actual  shrinkage  of 
the  predictive  power  in  at  least  one  case. 

Despite  these  criticisms,  the  multiple  correlation  found  for  the  binomial  ranking 
eauation  is  in  the  same  range  as  those  used  with  success  in  areas  such  as  employee 
selection.  This  equation  could  be  used  to  predict  the  binomial  ranking  scores  of 
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individual  pilots.  The  predicted  binomial  ranking  scores  reflect  the 
probability  that  a  pilot  will  have  mishaps  at  less  than  the  expected  rate. 
The  equation  could  be  applied  to  a  group  of  pilots  to  differentiate  between 
those  who  were  expected  to  perform  well  (high  scores!  and  those  who  were  not 
( 1  ow  scopes) . 


_ SUMMARY  TABLE  FROM  STEPWISE  REGRESS! Q1 

The  dependent  variable  is  Binomial  Ranking 


STEP 

VARIABLE 
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INCREASE 

1  VALUE 

NUMBER 

ENTERED 

R  Rj* 

SQUARED 

!_NR->nuA^RED 

TO  ENTER 

1 

Last  yr .  combat 

.52 

.27 

.2748 

428.7 

2 

Birth  year 

.55 

.30 

.0252 

40.7 

3 

Months  combat 

.  56 

.31 

.0128 

21 .1 

<• 

V  T  3 

.57 

.32 

.0085 

14.1 

K 

Level  college 

.57 

.32 

.0034 

5.6 

6 

Current  source 

.57 

.33 

.0024 

4.0 

• 

Total  f 1 ignt  hours 

.57 

.33 

OJ 

O 

O 

3.6 

< 

merest  award 

.58 

.33 

.0022 

3.7 

? 

Rank 

.58 

.33 

.0019 

3.2 

1 0 

VT4 

.58 

.34 

.0018 

3.1 

Figure  22 
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APPENDIX  A 


The  results  of  the  A4  aircraft  task  analysis  are  shown 
c*  ciQures  1  through  o.  The  figures  •illustrate  the  techninue 
used  and  t'ne  ’•elationsrip  of  work  load  to  accidents. 
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ABSTRACT 

A  state  of  fatigue,  induced  by  continuous  performance  of  tasks  over 
prolonged  working  time,  was  studied  in  a  laboratory  environment,  The 
tasks  were  selected  to  resemble  the  airborne  activities  necessary  to 
a  prolonged  ASW  mission. 

Two  loading  conditions  were  imposed  such  that  12  subjects  in  a  High 
Work  Load  (HWl)  group  worked  without  rest  for  approximately  18  hours 
and  12  different  subjects  in  a  Low  Work  Load  (LWL)  group  worked  for 
the  same  time  span,  but  w ere  permi tted  equal  time  on  and  off  work, 
such  that  their  workload  was  approxima tel y  one-half  that  of  the  HWL 
group.  Psychological,  physiological,  biochemical,  and  performance 
measures  were  taker,  for  each  individual  subject. 

Performance  decrement  was  obtained  for  the  HWL  reflecting  at  least 
moderate  fatigue.  A  specially  designed  Discrete  Tracking  Task  was 
the  most  sensitive  index  of  fatigue.  The  task  was  moderately  diffi¬ 
cult  with  almost  excessive  temporal  demands.  The  HWL  group  also  made 
larger  time  estimation  errors  late  in  the  session  than  the  LWL  group, 
,Nc  other  psychological,  biccnemical ,  or  physiological  variable  was 
correlated  with  performance  decrement  (Fatigue)  in  this  study. 
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INTRODUCTION 


fatigue  1 s  an  elusive  phenomenon.  Although  virtually  everyone  has  experienced 
it,  fatigue  is  extremely  difficult  to  define  to  everyone's  satisfaction,  and 
perhaps  even  more  difficult  to  measure.  Some  authorities  consider  the  term  of 
negative  value  and  prefer  not  to  use  it.  Nonetheless,  "FATIGUE!"  is  frequently 
thought  to  be  a  contributor  to  accidents  and  reduced  effectiveness  of  personnel. 

If  so,  understanding  fatigue  is  crucial  to  all  areas  of  endeavor,  and  particularly 
to  military  commanders,  who  may  find  it  desirable  to  schedule  personnel  for 
frequent  and/or  extended  missions. 

Idany  studies  have  been  performed  to  improve  our  understanding  of  fatigue.  Fatigue, 
for  most  of  tne  scientific  studies,  has  been  operational ly  defined  as  the 
degradation  in  performance  or  affective  state  resulting  from  previous  work. 

This  definition,  of  course,  could  include  studies  such  as  those  in  the  area  of 
vigilance,  in  which  boredom  may  play  a  prominent  role.  Indeed,  boredom  may  be 
on  one  end  of  an  activity  continuum  with  fatigue  at  the  opposite  end  with  normal 
daily  activities  along  the  mid  points.  However,  boredom  need  not  be  a  concomitant 
cf  work  or  even  extended  work.  Therefore,  it  is  considered  for  the  purposes  of 
tnis  present  discussion  as  a  confounding  variable  and  as  something  to  be  avoided 
in  fatigue  studies  insofar  as  possible  at  the  present  time.  Sleep  deprivation 
for  extended  periods  is  also  viewed  as  a  separate,  although  relevent,  variable 
and  is  purposely  ignored. 

Once  fatigue  is  well  understood,  it  will  be  possible  to  include  its  interaction 
with  the  variaoles  of  boredom,  environmental  stress,  anxiety,  abnormal  sleep 
deprivation,  diurnal  cycle  effects,  and  similar  stresses.  Thus,  the  present 
authors  would  accept  the  operational  definition  above  only  if  qualified  to  exclude 
boredom  and  other  confounding  stresses. 
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Most  of  the  fatigue  studies  reported  are  limited  by  the  relatively  limited  nature 
of  the  investigation,  In  fact.  Harris  .»n,j  hit  ccv.'orkers  { Id  56)  state  aft,-,  an 
extensive  review  of  the  literature  that  the  level  of  per*  forma  rice  as  a  function 
-fff-timc  h  a  s  -h'S"»gr~bWh~STud  i  ed  sy  s  tematTcaTlyuiule  r  long-term  or  short-term  con¬ 
ditions, 

A  number  of  laboratories  have  conducted  programs  which  included  several  studies. 
Especially  notable  were  tne  studies  conducted  by  the  R.A.F.  Institute  of  Aviation 
.Medicine  (Fraser,  1954;  Fraser,  1955;  Fraser.  1957;  Fraser  and  Samuel,  1956;  and 
Jackson,  1956;.  Among  the  conclusions  reached  by  these  workers  are:  (1)  a  sig¬ 
nificant  fatigue  effect  occurs  after  continuous  ten  hours  of  flight  in  a  piston 
engine  aircraft;  and  after  three  one-hour  sorties  in  jet  fighter  aircraft  or 
after  two  one-hour  sorties  at  night;  (2)  no  significant  fatigue  effect  is 
observable  after  continuous  sorties  for  3-4  hours  in  a  jet  bomber  during  the 
day;  (3)  considerable  fatigue  may  be  produced  by  flying  four  15-hour  sorties  at 
night  with  one  day's  rest  between  each  sortie  for  long-range  piston  engined  air¬ 
craft;  ana  (4)  marked  changes  in  performance  and  subjective  states  after  prolonged 
repeated  flights  can  occur  without  very  extensive  and,  as  yet,  detectable  physio¬ 
logical  cnanges  (Fraser,  1953).  The  specific  dependent  measure  which  seems  to  be 
the  most  reliable  index  of  fatigue  was  the  variance  of  the  subjects'  estimates  of 
tne  time  at  wnich  a  slowly  varying  signal  passed  through  the  center  of  a  projected 
display  (the  Z  function),  This  task  was  rarformed  before  and  after  flight. 
Although  one  report  in  the  series  (Jackson,  1956)  reported  a  decrement  in  holding 
a  constant  heading  after  forty  mi r.  ites,  this  may  be  a  function  of  boredom  rather 
than  fatigue. 


Efforts  to  measure  a  decrement  in  on-the-job  performance  have  been  generally  un¬ 
successful,  A  typical  study  (Shaw,  1954)  involved  Air  Traffic  Control  operators 
wnc  were  administered  a  test  before  and  after  regular  seven-hour  duty  watches 
when  traffic  was  heavy  enough  for  near  continuous  work.  Although  subjects  occasion¬ 
ally  complained  of  fatigue,  no  performance  impairment  was  obtained.  Grand jean 
(1969)  notes  that  such  s tudies  have  one  inherent  drawback;  work  has  a  stimulating 
effect  on  (a  person)  wnicn  may  overcome  existing  fatigue.  This  is  particularly 
true  if  the  worker  knows  he  is  being  observed  or  tested. 
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A  number  of  studies  have  attempted  to  correlate  physiological  or  ^vchological 
modifications  to  performance  decrement.  Physiological  concomitants  have  not 
beer  promising,  to  date,  brandjean  ( i y by )  states  that  there  are  no  practical0 
phys iolodical  or  psychological  objective  tests  of  fatigue  which  can  be  used  in 
industry  with  success. 

In  soite  of  the  lack  of  systematic  knowledge  after  considerable  effort,  there  is 
a  need  for  the  definition  of  a  quantitative  relationship  between  the  duration  of 
activity  and  the  degradational  effects  which  everyone  is  sure  exists,  and  objective 
methods  of  predicting  decrement.  Carpenter,  Creamer,  Gabriel,  and  Burrows  (1969) 
defined  a  set  of  goals  wnicn  appear  valid.  These  included: 

1.  Define  performance  parameters  influenced  by  continual  work  load. 

2.  Determine  the  work  load  which  results  in  the  degradation  in 
performance . 

3.  Define  physiological  and/or  behavioral  indices  of  the  output 
of  fatigue. 

4.  Develop  an  understanding  of  the  underlying  mechanisms  involved 
in  performance  degradation. 

3.  Discover  methods  of  mitigating  or  alleviating  degradation 
witnout  adverse  side  effects. 

In  tne  Carpenter,  et  al.,  study,  a  large  number  of  potential  behavioral  and 
physiological  indices  were  evaluated  in  eiqht-ncur  simulated  airborne  ASW  missions. 
Toey  also  attempted  to  evaluate  Bartlett's  hypothesis  of  increased  variaoility  and 
perceptual  oreak-uo  (Bartlett,  1953).  One  of  the  limitations  in  the  study,  however, 
was  tne  slight  degree  of  faoigue  brought  about  by  the  eight-hour  simulated  missions. 
!t  was  decided,  therefore,  to  conduct  an  experiment  which  would  attempt  to  increase 
the  fatigue  state  and  use  some  of  the  most  promising  indices  of  fatigue  used  in  the 
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Carpenter  study.  Psychological  dependent  variables  selected  were  near  and  far 
phoria,  time  estimation  and  size  constancy.  Physiological  variables  deemed 
showing  promise  for  application  in  the  operational  settinq  were  heart-rate, 
respiration  rate,  basal  skin  resistance,  and  core  temperature, 


Several  additional  promisi ng  measures  were  added.  Kalsbeek  (I96fi)  found  that 
aeart-rate  variability  ("sinus  arrythmia")  was  directly  related  to  the  work  load. 
This  variability  decreased  as  a  function  of  the  difficulty  of  the  task.  It  was 
hypothesized  that  heart-rate  variability  over  longer  time  periods  might  be 
related  to  fatigue. 

The  phospholipids  yielded  statistical  significance  as  a  function  of  load  in  the 
Carpenter  study.  Analysis  techniques  are  involved  and  it  was  decided  that 
operational  use  would  be  limited.  Urine  analysis  of  sugar,  protein  and  pH  was 
substituted  since  these  analyses  are  easily  performed. 

To  allow  for  ar,  extension  to  16  hours  or  more,  a  laboratory  study  was  designed. 
Tasks,  however,  were  selected  to  represent  those  typically  required  in  airborne 
ASW  missions . 

In  addition  to  determining  sensitive  indices  of  performance  decrement,  the  useful 
ness  of  tne  variables  in  predicting  specific  performance  decrement  was  to  be 
assessed.  This  latter  factor  greatly  influenced  the  experimental  design. 

Ir  summary,  the  objectives  o*  the  current  study  were  defined  as: 

1.  Validate  specific  results  of  the  Carpenter,  et  al .  (1969)  study. 

2.  Extend  the  fatigue  state  until  greater  performance  decrement 
was  obtained. 

3.  Define  the  correlation  of  performance  decrement  and  psycholo¬ 
gical  and  physiological  measures;  that  is,  determine  the  amount 
of  cnanqe  for  a  given  degree  of  performance  decrement. 

4.  Evaluate  heart-rate  variability  as  a  measure  of  fatigue. 
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Twenty-four  male  graduate  and  under-graduate  students  from  California  State 
College,  Long  Beach,  were  used  as  subjects  (5s).  Volunteers  were  solicited 
for  paid  participation.  From  those  volunteering,  Ss  were  selected  on  the  basis 
of  good  health  (see  Appendix  for  screening  questionnaire),  prior  military  ser¬ 
vice  (not  Possible  in  all  cases),  and  willingness  to  complete  the  experiment  or 
go  without  oay.  The  age  of  those  selected  ranged  from  19  -  29  and  averaged  22 
years . 


EXPERIMENTAL  design 

A  Lindquist  'Type  I"  experimental  design  was  used  (Lindquist,  1963).  Twenty- 
Four  Ss  were  randomly  assigned  to  or*  of  two  groups,  twelve  to  the  "Low"  Work 
Load  group  (LWL),  and  twelve  to  the  "High"  Work  Load  (HWL)  group.  Work  period, 
the  secord  independent  variable,  was  a  within  subjects'  comparison.  Subjects 
participated  for  one  day. 

Two  Ss  were  tested  each  experimental  day.  Ss  for  a  given  day  were  assigned  to 
the  same  workload  condition  to  reduce  motivational  confoundings .  Other  effects 
for  days  were  controlled  by  counterbalancing . 


EQUIPMENT  AMD  TASK  DESCRIPTION 


Experimental  Area 

The  experimental  room  is  represented  in  Figure  1  .  The  room  was  partitioned  by 
portable  screens  into  three  general  areas:  A  work  area,  which  included  three 
work  tables  on  which  the  various  tasks  were  performed;  a  sleeping  area,  which 
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LABORATORY  OPERATING  AREA  FLOOR  PLAN 


conta’.-'e;:  two  Army  cots  anc  oedding;  jmi  an  experimenters  area  wnich  rotiln  i :u.J 
task  programming,  data  recording,  and  processing  equipment.  Tne  arrangement  wo- 
designed  to  minimize  visual  intrusions  and  distractions  for  the  Ss.  Lavatories 
were  located  approximately  50  feet  away. 

Temperature  within  the  test  chamber  was  maintained  between  74°  and  76° F .  except  * 
during  sleep  periods,  when  it  was  raised  to  78°  to  ensure  comfort  with  a  single 
blanket.  Illumination  was  from  diffused  fluorscent  lighting.  Incident  light  on 
the  work  surfaces  was  80  foot-candles.  Noise  levels  in  the  laboratory  ranged 
from  64  -  68  dBC.  Much  of  the  noise  resulted  from  the  air-conditioning  and  was 
greatest  for  the  lower  octave  bands. 


Physiological  Recordings 

Physiological  data  recorded  included  heart-rate,  basal  skin  resistance,  thoracic 
and  abdominal  .respiration,  oral  temperature,  and  urinalysis  of  pH,  protein  and 
glucose.  For  the  first  three  (continuous)  measures  an  automatic  timer  controlled 
the  circuitry  so  that  for  each  successive  six  minutes,  three  minutes  of  data  were 
collected  for  each  in  alternate  three-minute  intervals. 

To  obtain  heart-rate  data,  Beckman  biopotential  skin  electrodes  were  applied  to 
the  shaven  chest  after  cleansing  the  area  with  isopropyl  alcohol.  A  detergent 
electrode  paste  was  used.  The  heart-rate  signals  were  amplified  by  a  differential 
amplifier  with  a  gain  of  15,000  and  a  band  pass  of  5  -  45  Hz  to  enhance  the 
signal -to-noise  ratio. 

To  obtain  values  of  variability  in  heart-rate,  the  basic  data  recorded  wore  the 
times  required  for  10  successive  heart-beats.  This  was  accomplished  by  detecting 
the  P-wave  of  each  successive  beat  by  a  Schmitt  trigger.  An  integrated  "divide 
by  10"  circuit  was  used  to  gate  the  initial  pulse  and  reset  after  the  10th  beat. 

An  oscillator  counted  the  time  required  for  the  ten  beats  and  was  recorded  by  a 
digital  printer. 
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Jata  were  acquired  for  both  thoracic  and  abdominal  respiration.  Specially 
developed  strain  gauges,  consisting  of  a  thin  brass  seam,  encapsulated  in  R.T.V. 
44-LU:uirw^4ubbets-cwarui-»W(»4-. — wei**?-’ connects  d“tO'*orre=rti3'tf^rtiich“ 
elastic  bands  sewn  to  a  fabric  belt.  Velcro  was  used  on  the  ends  of  the  hell 
to  allow  adjustments,  One  belt  was  worn  around  the  chest,  while  another  one 
enclosed  the  waist,  The  outputs  of  the  transducers  were  fed  to  self-balancing 
bridges  and  recorded  on  a  Beckman  strip  chart  recorder.  Three  minutes  of  data 
were  acquired  for  each  subject  in  a  given  six-minute  interval. 

Sasal  skin  resistance  was  sensed  with  electrodes  placed  approximately  8  -  10 
inches  above  the  ankle  on  the  right  tibia.  A  10  microampere  100  Hz  activating 
current  was  passed  Detween  this  electrode  and  the  common  ground  electrode  located 
approximately  2  inches  above  the  right  ankle.  One  value  was  obtained  for  each 
ten  oeats  of  tne  heart,  processed  through  a  digital  voltmeter  and  printed  out. 

A  belt  pack,  consisting  of  batteries,  etc.,  was  worn  at  the  waist.  Leads  were 
bundled  into  a  45-inch  cable  for  flexibility  in  movement  from  one  task  to  another 
around  the  laboratory.  For  sleep,  the  battery  pack  was  mounted  on  the  wall 
adjacent  to  the  head  of  the  cots.  All  digitized  heart-rate  data  were  also 
recorded  in  analog  form  on  the  strip  chart  recorder  for  redundancy. 

Oral  temperature  was  obtained  with  a  standard  clinical  oral  thermometer. 

urinalysis  to  determine  pH, 'glucose,  and  protein  concentrations  was  performed  by 
tne  Comcistix  method. 

Slock  and/or  wiring  diagrams  of  this  physiological  data  system(s)  are  provided 
in  the  Appendix. 
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Performance  Tasks 


Performance  tasks  were  divided  into  two  types  -  loading  tasks  and  tost  tasks. 

'■'t-  loading  problems  were  designed  to  qenerate  fatigue,  while  the  test  tasks  _ 

were  designed  to  measure  fatigue  unconfounded  by  differential  practice.  Tasks 
were  selected  to  relate  to  typical  ASW  operator  requirements. 

The  continuous  tracking  task  was  the  primary  electro-mechanical  device  required 
for  tne  loading  portion  of  the  experiment,  The  block  diagram  for  this  system  is 
■provided  in  Figure  2.  The  5.  was  required  to  center  a  "blip"  on  a  five-inch 
catnode  ray  oscilloscope  by  manipulating  a  control  stick.  The  "blip"  was  per- 
turbated,  in  the  horizontal  axis  only,  by  a  random  forcing  function  from  a  random 
noise  generator.  The  control  stick  was  spring-loaded  to  center,  presented  little 
resistance  to  displacement,  and  was  free  to  move  through  360°. 

A  feedoack  loop  in  tne  circuit  increased  the  difficulty  of  the  task  as  a  function 
of  the  operator's  (Ss)  performance.  The  output  of  the  averager  was  continuously 
fed  buck  into  the  analog  multiplier  to  vary  the  display  sensitivity.  The  dis¬ 
play  sensitivity  varied  from  2cm/volt  error  in  the  easiest  state  to  20cm/volt 
e^ror  in  the  most  difficult  state.  The  voltage  output  of  the  averager  was  also 
fed  through  an  amplifier  and  integrated  over  one-ninute  intervals  of  time.  This 
value  was  displayed  on  an  electronic  counter  as  the  performance  measure.  Thus, 
the  dependent  variable  was  the  forcing  function  displayed  as  a  unit  of  voltage  In 
arbitrary  units.  Poor  performance  and/or  low  effort  were  represented  by  a  low 
forcing  function  voltage.  Difficulty  level  varied  from  105  to  400  volts. 

Additional  loading  tasks  consisted  of  cognition  tasks,  such  as  solving  navigational, 
plotting  and  fuel  consumption  problems.  A  total  of  82  such  problems  were  prepared. 
Twenty-seven  of  these  problems  were  estimated-time-of-arri val  problems,  while 
thirty-four  were  plotting  problems,  and  twenty-one  were  fuel  consumption  problems. 
Forms  were  prepared  which  provided  the  necessary  data  and  Ss  were  to  carry  out  the 
mathematical  solution.  The  Appendix  provides  an  example  of  each  type  of  problem, 
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oscu  La^copt: 


Tne  test  tasks  included  Discrete  Tracking,  Track-to-Intcrcept  (TTl),  and  psycho- 


y  :>  i  ca  i 


stancy.  The  Discrete  Tracking  (DT)  task  consisted  of  six  vertical  and  six  hori¬ 
zontal  lights  in  an  "  j-  "  arrangement  mounted  on  a  vertical  panel.  Push-button 
switches  on  a  keyboard  were  similarly  arranged.  If  a  light  was  on,  the  depression 
of  the  corresponding  push-button  would  extinguish  it.  Two  lights  were  illuminated 
at  a  time,  one  in  the  vertical  row  and  one  in  the  horizontal  row,  in  a  random 
order.  Tne  S's  task  was  to  turn  out  the  lights  as  soon  as  he  could,  using  the 
left  hand  for  the  vertical  column  of  switches  and  the  right  hand  for  the  horizontal 
row  of  switches.  The  presentation  rate  of  the  lights  could  be  programmed  for 
a6,  30,  or  53  light  pairs  per  minute  and  was  independent  of  S's  response.  A  switch 
on  the  experimenter's  console  was  used  to  establish  the  desired  rate.  The  cir- 
cuit  provided  for  the  accumulation  of  the  total  time  the  operator  was  on  target 
(i.e.,  lights  turned  out)  for  each  one-minute  interval. 


A  vigilance  task  was  performed  during  the  discrete  tracking  test.  This  task 
involved  detecting  tne  illumination  of  a  small  neon  light  located  approximately 
at  seated,  eye-level  and  30  inches  to  the  left  of  the  operator  at  a  45°-angle. 

A  push-button  switch  mounted  on  the  table-top  to  the  S's  left  was  used  to  indicate 
signal  detection  and  turn  off  the  light.  The  light  was  presented  at  random,  with 
the  following  constraints:  a  total  of  5  presentations  in  each  block  of  ten  one- 
minute  discrete  tracking  periods,  no  more  than  three  presentations  would  occur  in 
one  five-minute  period,  and  no  lights  presented  before  15  seconds  or  after  45 
seconds  of  a  given  one-minute  interval  had  elapsed.  A  digital  timer  connected  in 
series  with  the  light  and  switches  allowed  the  experimenter  to  record  detection 
times.  The  light  was  turned  off  by  £_  if  not  detected  within  one  minute. 

The  S  used  a  standard  Bausch  and  Lomb  Orthoratt.*  to  measure  near  and  far  phoria. 

Limb  steadiness  and  time  perception,  although  two  separate  tasks,  were  administered 
simultaneously.  Limb  steadiness  involved  inserting  a  one-eighth-inch  metal  stylus 
into  a  one-quarter-i rich  circular  aperture  with  the  preferred  hand.  Contact  of  the 
stylus  with  the  side  of  the  aperture  completed  a  circuit  which  activated  a  digital 
timer.  The  recorded  measure  was  error.  The  S  initiated  the  trial  when  he  was 
"set"  and  terminated  when  he  judged  his  time  estimation  interval  had  elapsed.  Time 
intervals  to  be  estimated  were  between  34  and  73  seconds.  One  value  was  randomly 
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selected  for  each  trial.  The  S  turned  a  switch  starting  the  time  estimation 
tinier  and  actuating  the  limb  steadiness  circuit.  When  he  judged  that  the  given 

W  1  u  „  ,4  »1  „  ~  U  „  „_J  4.U  .  J  *.  ^  A  -  . .  .  „  .4 _4  u  J  -  *.  J ,.  All  r- 
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counted  bacKward  from  100  by  3's  while  performing  these  tasks. 


The  test  for  size  constancy  required  S  to  estimate  the  actual  length  of  a  pro¬ 
jected  vertical  line  from  a  distance  of  9  feet,  Eighty-one  photographic  slides 
of  narrow  black  rectangles,  ranging  in  length  from  5  to  50  cm  in  1  cm  units,  were 
projected  on  a  white  paper  screen.  Ten  slides  were  displayed  during  each  test 
unit,  by  means  of  a  Carousel  slide  projector.  The  order  of  the  slides  was 
randomly  determined  and  repetition  of  lengths  was  also  random.  The  S  viewed  the 
projected  image  and  using  a  reference  scale  provided  at  his  station,  estimated 
the  length  of  the  image  in  centimeters.  Figure  3  provides  representative  views 
of  the  apparatus.  The  Appendix  provides  the  specific  task  instructions. 

PROCEDURE 


Preliminary  Study 

Four  Ss  participated  in  a  preliminary  study  consisting  of  two  test  days,  one  day 
for  each  work  load  condition.  This  "pilot"  study  served  as  a  dress  rehearsal  for 
the  experimenters,  established  training  requirements  and  assisted  in  refining 
the  experimental  schedule.  These  data  were  not  included  in  the  results. 

Daily  Schedule 

On  a  test  day,  two  Ss  reported  for  duty  at  0345  hours.  Both  $s  were  assigned  to 
the  same  work  load  group.  After  a  preliminary  briefing  and  fami  larlzation  with 
the  laboratory  area,  Ss  filled  out  a  questionnaire  and  had  their  oral  temperature 
taken.  Next,  a  brief  fami larization  with  the  task  equipment  was  provided.  Dur¬ 
ing  the  briefing,  Ss  were  informed  of  the  monetary  bonus  of  $20.00,  $12.50,  and  $5.00, 
for  the  best  3  performances  in  their  group.  Although  Ss  knew  about  the  general 
nature  of  the  study,  they  were  not  informed  of  the  existence  of  two  qroups.  The 
physiological  recording  preparation  consisted  of  the  attachment  of  sensors  and 
the  physiological  harness  to  the  Ss.  The  foregoing  activity  required  about  one- 
naif  hour. 
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Vir»n  of  1*0  laboratory 


Specific,  in-depth  training  on  ail  tasks  mas  given  next.  An  average  of  about 
two  hours  was  devoted  to  this  training,  althouuh  this  time  varied  somewhat,  due 
to  individual  differences.  Since  most  of  the  tasks  were  largely  self-administered, 
+t-~yaS"THiport3ftt  tO'brxe'rtatr  fc~  Lndrd'Jghl y^am 1 1  la r  wi'ih  the  ' 

requi rements .  Training  consisted  of  demonstrating  the  task,  instructing  in  the 
specific  components,  and  providing  practice  with  feedback. 

Two  complete  test  cycles  were  performed  as  covariant  measures  before  the  load-test 
cycles  were  started.  Since  only  one  set  of  equipment  was  available,  Ss  were 
sequenced  through  tasks  in  the  same  order  hut  offset  by  one  task,  The  only 
difference  between  covariant  tests  and  later  tests  was  the  presentation  rate  on 
the  discrete  tracking  task,  which  was  increased  from  26  to  53  light  pairs  per 
mi nute. 

Subjects  started  the  loading  tasks  immediately  after  completing  the  second  co¬ 
variant  session. 

me  general  plan  of  the  experiment  was  to  load  the  Ss  for  a  series  of  consecutive 
time  periods  with  test  periods  in  between.  The  loading  period  lasted  approximately 
70  minutes,  while  the  test  period  required  20  minutes  (see  Figure  4  ).  One  S 

worked  on  continuous  tracking  while  the  other  worked  on  cognitive  problems. 

After  35  minutes,  they  changed  place  for  35  more  minutes. 

A  similar  procedure  was  used  during  the  test  tasks.  One  S_ would  work  for  the 
assigned  length  of  time  (approximately  5  minutes)  at  one  task,  '.g.,  discrete 
tracking,  while  the  other  would  work  at  the  psychological  task,,  etc.  in  the  test 
pnases.  However,  the  Ss  completed  each  test  task  just  once,  before  going  back 
to  the  loading  tasks  to  start  the  whole  cycle  over. 

The  Ss  in  the  HWL  group  received  no  breaks,  except  for  1  or  2  trips  to  the  lava¬ 
tory,  until  just  before  the  sleep  period.  Eating  (sandwiches,  crackers  and 
peanut  butter,  fruit,  cookies,  and  candy  bars)  took  place  while  the  S  was  working 
on  work  load  tasks.  Food  was  provided  at  1330  and  1900  hours. 


The  LWL  group  followed  the  same  schedule  as  the  HWL  group,  with  but  one  change, 
After  each  5  min.  in  the  work  load  phases,  a  5-min.  rest  was  provided.  There 
fore,  the  LWL  Ss  were  loaded  only  one-half  as  much  as  the  HWL  Ss,  excluding 

-test-peri  odrr - - - - - - - - - - - * — - — - - - - - — 

The  completion  of  the  ninth  cycle  terminated  the  work  portion  of  the  test  day. 

The  phy-, iol ogicul  harness  was  uncoupled  and  the  Ss  taken  to  the  lavatory,  where 
a  urine  sample  was  obtained  and  tested.  When  the  Ss  returned  to  the  laboratory, 
they  completed  the  self-inventory  which  was  designed  to  determine  their  subjective 
state  of  fatigue.  The  physiological  harness  was  then  re-coupled  and  the  Ss 
retired  to  the  sleeping  area.  Laboratory  lighting  was  extinguished  during  the 
sleep  phases . 

The  Ss  were  awakened  at  0730  hours,  after  which  they  completed  one  additional  test 
period  identical  to  those  of  the  preceeding  day.  The  Ss  were  then  released  and 
the  laboratory  secured. 

Three  experimenters  (El  worked  8-hour  shifts.  Neither  Ss  nor  £5  were  allowed  out 
of  tne  laboratory  except  for  the  required  lavatory  visits. 


RESULTS 


The  results  are  presented  under  three  separate  headings.  First,  evidence  Is  pro¬ 
vided  to  determine  whether  fatigue  was  produced  In  the  HWL.  Potential  correlates 
of  fatigue  are  then  considered  and,  finally,  data  testing  Bartlett's  Hypothesis 
are  presented. 

During  the  long,  continuous  experimental  sessions,  some  data  were  lost.  Equip¬ 
ment  failure  was  the  primary  cause  of  lost  data,  but  In  two  cases  the  Ss  failed 
to  understand  the  instructions.  In  most  cases  where  data  were  lost,  data  from 
a  comparable  S  in  the  other  group  were  discarded,  so  that  an»lyses  were  balanced 
for  order  effects.  A  very  few  physiological  values  were  estimated  by  using 
adjacent  values.  In  these  cases,  the  Degrees -of- Freedom  (df)  in  the  error  term 
was  corrected. 

The  experimental  design  Is  shown  schematically  in  Figure  5.  Subjects  are  nested 
within  workload,  but  are  crossed  with  test  periods  (or  loading  periods  or  time). 
For  Discrete  Tracking,  two  additional  variables  were  added  (Minutes  within  test 
periods  and  horizontal  versus  vertical  Tlme-on-Target  (TOT)).  The  basic  design 
remains  unchanged,  however,  with  these  added  variables  being  crossed  with  all 
other  variables. 

The  .05  level  was  used  for  all  significance  tests.  Any  £  ratio  followed  by  an 
asterisk  is  significant  at  p^.OS. 

OPERATIONAL  DEMONSTRATION  OF  FATIGUE 

It  was  important  for  the  purposes  of  this  experiment  that  there  was  evidence  of 
fatigue.  Furtner,  magnitude  of  fatigue  at  different  times  in  the  experimental 
session  would  allow  for  correlation  of  potential  fatigue  indices  with  performance 
decrement  attributable  to  fatigue. 


Ill 


Discrete  tracking  was  the  primary  task  to  demonstrate  fatigue,  but  continuous 
tracking,  the  cognitive  track  to  intercept  problems,  and  the  Self-Report  Inven¬ 
ts  ry ,  were  also  uscu.  o  1 1  ice  a  goo u  ueat  or  learning  was  anticipated,  the 
divergence  of  the  LWl  and  HWl  functions  was  considered  to  be  the  best  operational 
demons tra ti on  of  fatigue.  Further  evidence  of  fatigue  could  be  inferred  from  a 
decrement  for  the  HWL  late  in  the  experimental  session,  should  a  decrement  occur. 


Discrete  Tracking 

The  performance  measure  for  Discrete  Tracking  (DT)  was  a  Time-on-Target  (TOT)  in 
seconds  cumulated  over  one-mi nute-periods  within  the  Horizontal  and  Vertical  dimen¬ 
sions.  The  pace  was  rapid  producing  the  relatively  low  mean  TOT  scores  shown  In 
Figure  6. 

The  most  salient  feature  of  these  data  is  the  significant  WL  X  Test  Period  inter¬ 
action  (see  AHOVA  Summary  in  Table  I).  The  groups  begin  to  diverge  on  the  fifth 
test  period  and  the  difference  is  obvious  on  the  seventh  test  period. 

3oth  groups  show  approximately  equal  performance  decrement  on  "the  morning  after", 
Test  10.  The  relative  difference  between  the  croups  is  virtually  unchanged  by 
the  short  sleep  period  (4-6  hours). 


Tract-to-Intercept  Cognitive  Problems 

The  cognitive  test  problems,  track-to-intercept,  were  scored  in  three  parts,  each 
having  a  value  of  one.  Therefore,  a  (S  could  obtain  0,  1,  2,  or  3  correct  for  a 
particular  test  period.  The  mean  number  correct  responses  are  shown  in  Figure  7. 
The  ANOVA  F  values  are  also  shown. 

In  order  to  reduce  the  error  variance  and  also  to  statistically  equate  the  two  WL 
groups,  an  analysis  of  covariance  was  run.  The  two  preliminary  covariant  trials 
were  combined  as  the  covariant  and  test  periods  6,  7,  8,  9,  and  10  were  sumned  for 
the  variate.  The  WL,  F  (1,21)  =  .849  was  not  significant. 
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TABLE  1 


Analysis  of  Variance  Summary  Table  for'  Discrete 


Source  df  Mean  Square 


1 

I  Work  Load 

1 

657.46 

k  Test  Period 

9 

540.03 

L  Minutes  in  Test  Period 

4 

33.98 

M  Horl2ontal/Vert1cal 

1 

509.65 

J(I)  Subject  (nested  within) 

14 

1365.52 

IK 

9 

267.78 

1L 

4 

7.77 

<L 

36 

6.62 

1M 

1 

13.79 

iC 

9 

12.41 

LM 

4 

6.25 

J«  I) 

126 

67.46 

JL(l) 

56 

11.67 

JM(  I ) 

14 

694.84 

IKL 

36 

5.50 

I  KM 

9 

18.79 

ILM 

4 

1.41 

KLM 

36 

2.92 

JKL(I) 

504 

5.88 

JKM(I) 

126 

20.83 

JLM(I) 

56 

4.34 

I  KLM 

36 

3.27 

JKLM(I) 

504 

3.27 

*  Significant  at  .05  level 
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19 


Tracking 


F. 


0.48 
2.02 
2.91  * 
0.73 

3.97  * 
0.67 
1.13 
0.02 
0.62  * 
4.43  * 


0.94 

0.90 

0.07 

0.90 


1 .00 


Continuous  Tracking 

The  adaptive  Continuous  Tracking  (CT)  task  yielded  a  response  measure  which 
ref  Tec  ted  the  !eve  !of~ difficulty. 


Thai'S  f  OT"u  i  rig  i 


arbitrary  voltage  units,  such  that  large  Values  represent  hard  work  and  good 
cerformanee  combined,  With  the  control  stationary,  a  score  of  105  resulted  in 
a  one-minute-period.  The  best  score  obtained  was  just  above  400. 


Tne  means  are  snown  In  Figure  8  .  It  is  obvious  that  the  groups  differ  for  all 
loading  periods.  The  Wl  X  Loading  Period  interaction  was  not  significant  to 
support  the  trends  for  trie  High  and  Low  WL  functions  to  diverge  across  loading 
periods . 


The  main  effect  between  groups  dissipated  when  an  Analysis  of  Covariance  was 
used.  Loading  period  one  was  used  as  the  covariant  and  periods  6,  7,  8,  and  9 
were  combined  as  the  variate.  The  £  (1,21)  of  1.704  was  not  reliable.  The  LWL 
adjusted  mean  was  372  and  the  HWL  adjusted  mean  was  348  with  the  difference  still 
favoring  the  LWL  group. 


Using  the  DT  function  for  an  estimate  of  the  onset  of  fatigue,  one  further  analysis 
was  performed  to  see  if  the  WL  functions  diverged  significantly  after  the  fifth 
loading  perid.  The  £  (3,66)  =  4.651  was  significant  for  the  WL  by  Loading  Period 
interaction.  The  HWL  decrement  was  significant  for  these  four  loading  periods. 


During  the  last  four  load  cycles  performance  did  deteriorate  with  the  loading 
cycle.  Means  were  computed  for  each  of  six  five-minute  work  periods  for  the  cycle. 
From  the  first  to  sixth,  the  means  were:  354,  341,  336,  328,  326,  and  327. 


Self-Report  Inventory 

The  Ss  self-report  responses  are  summarized  in  Table  II.  Means  for  High  and  Low 
WL's  and  Estimated  Standard  Error  of  Means  are  given. 
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TABLE  1 1 

SELF-REPORT  INVENTORY  WITH  ITEM  MEANS  AND  STANDARD  ERRORS 
OF  MEANS  FOR  HIGH  AND  LOW  WORK  LOAD  CONDITIONS 


1  1  Strongly  Disagree 

2  =  Disagree 

3  *  Agree 

4  »  Strongly  Agree 


ITEM 


3.  Wnen  it  was  time  to  eat  lunch  today,  I 
seemed  to  have  lost  my  appetite. 


.  Time  seemed  to  pass  very  slowly. 


5.  The  break  for  lunch  seemed  to  refresh  me. 


6.  Any  errors  I  made  today  were  the  fault  of 
other  people. 


8.  I  did  not  maintain  the  same  level  of 
performance  throughout  the  day. 


.  It  was  difficult  to  wake  up  this  morning. 


11.  The  tasks  seemed  to  be  reiated  to  flight. 


I  felt  as  though  I  were  pushing  myself  to 
complete  the  tasks. 


I  had  difficulty  falling  asleep  last 


17.  I  felt  that  if  I  could  have  just  stopped 
and  closed  my  eyes  for  ten  minutes,  I 
could  have  continued  working  indefinitely 


18.  Now  that  the  day  is  over,  I  feel  tired.  | 


19.  I  could  not  have  continued  working  for 
several  hours  more. 


20.  I  did  not  get  a  good  night's  sleep 
last  nTgFTt. 


Table  11  (2  of  3) 


1  *  Never 

2  *  Sometimes 

3  3  Often 

4  -  Constantly 


Table  II  (3  of  3) 


1  =  Never 

2  *  Sometimes 

3  -  Often 

4  =  Constantly 


r 


I 

item  j 

.  .  1 

LOW 

WL 

HIGH 

WL 

MEAN 

S.E. 

MEAN 

S.E. 

40. 

...  I  began  to  sweat. 

1.17 

.11 

.92 

.29 

41  . 

...  I  became  sleepy . 

2.00 

.12 

1.75 

.75 

43. 

.  .  .the  noise  became  annoying. 

1  .75 

.13 

1.33 

.65 

44. 

.  .  .my  leg  fell  asleep. 

1 .08 

.08 

.92 

.29 

45. 

.  .  .1  rubbed  my  eyes. 

2.00 

.21 

1.83 

.83 

46. 

.  .  .the  instrument  harness  felt 
uncomfortable. 

'  2.25 

.18 

1.67 

1.07 

47. 

.  .  .it  became  difficult  to  make  decisions 
quickly. 

1.83 

.11 

2.08 

1.00 

48. 

- -  i. 

.  .  .a  pain  developed  in  my  neck  and 
shoulders. 

1 .58 

.26 

2.08  1 

1 .00 

49. 

.  .  .1  yawned  or  stretched. 

2.42 

.15 

2.67 

1 .07 

50. 

.  .  .the  ear  phones  became  uncomfortable. 

1.58 

.23 

2.92 

1.31 

2 


Several  items  offer  evidence  of  the  £s  perception  of  fatigue  symptoms.  The 
HWL  groups  felt  they  did  not  maintain  the  same  level  of  performance  over  trials, 
felt  more  tired  at  the  end  of  the  day,  felt  less  motivated  to  maintain  their 
-pe-rformafiee-a't-a  high  level  ;  fe-Vt-  thst  Jtheir  respcnses  vicre  mcre  slugg1'Shy  and  — 
felt  they  had  more  difficulty  making  quick  decisions  than  the  LWL  group. 


These  and  many  other  items  indicate  that  the  HWL  group  verbalized  more  fatigue 
symptoms  than  the  LWL  group. 


CORRELATES  OF  FATIGUE 


Given  performance  decrement  as  a  function  of 
gical  and  psychological  behaviors  may  change 
Several  variables  were  measured  to  determine 
ance  decrement. 


continuous  work,  certain  physlolo- 
systematically  with  the  decrement, 
their  relationship  with  the  perform- 


Phori  a 

Near  vertical,  near  horizontal,  far  vertical,  and  far  horizontal  phoria  were 
measured  each  test  period.  Zero  phoria  on  the  vertical  measures  was  indicated  by 
a  response  of  five,  the  center  value  of  a  nine-point  scale.  Zero  phoria  for 
horizontal  judgments  was  a  response  of  8. 

One  measure  was  obtained  in  each  of  the  four  phoria  tests  in  each  test  period. 

These  responses  were  then  transformed  to  deviations  from  5  (or  8  for  the  horizontal). 
The  algebraic  means  (use  sign)  of  these  deviation  scores  are  shown  in  Figures  9 
through  12.  The  data  offer  no  suggestion  of  progressive  changes  in  pho**ia  across 
Test  Periods. 
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lining  the  covariant  measures,  :he  phoria  scores  were  also  analyzed  with  an 
analysis  of  covariance.  Test  scores  6,  7,  8,  9,  and  10  were  summed  and  used 
as  the  variate.  The  WL  F  ratios  were  .834,  .127,  .976,  and  .939  for  Near  Vertical, 
— ,  vp*  r-~H  o  Hz  o  n  ta  T-, — f  a  r-Ve  rt  4  c  stT-; — artdf  ar-  Horizon  ■tat;  respcctiVcTyy  wTthr  1  and'?!  df, 
No  significant  effects  cere  obtained. 

Since  tendencies  toward  positive  and  negative  phoria  for  later  test  periods 
might  balance  one  another,  the  absolute  means  (disregard  sign)  were  also 
analyzed.  These  means  are  shown  in  Figures  13,  14,  15,  and  16.  There  were  no 
significant  effects  and  no  meaningful  trends. 

Again,  an  analysis  of  covariance  was  used  to  reduce  the  error  variance  and 
statistically  equate  the  WL  groups.  The  WL  F  ratios  were  1.921,  .066,  .076,  and 
2.163  for  Near  Vertical,  Near  Horizontal,  Far  Vertical,  and  Far  Horizontal, 
respectively,  all  with  1  and  21  df.  None  of  these  values  are  significant. 


Time  Estimation 

During  each  test  period,  the  Ss  made  one  time  estimate.  Their  rescv  .  es  were 
recorded  to  the  nearest  .1  second  and  these  values  were  transforms j  to  absolute 
deviations  from  times  to  be  estimated.  The  mean  absolute  error  scores  as  a 
function  of  WL  and  test  period  are  shown  in  Figure  17.  The  HWL  groups  made 
slightly  greater  errors  and  this  tendency  increased  in  the  later  work  periods 
out  tnese  differences  were  not  reliable.  The  discrete  tracking  data  are  also 
presented  for  a  direct  comparison  of  the  two  variables.  The  point  of  divergence 
for  both  functions  occurs  at  approximately  the  fifth  loading  period. 

An  analysis  of  covariance  using  two  preliminary  trials  as  the  covariant  and  com¬ 
bining  Test  Periods  6,  7,  8,  9,  and  10  as  the  variate,  was  significant,  F  (1,21)  = 
6.153.  This  analysis  statistically  equated  the  two  groups  on  the  basis  of  pre¬ 
liminary  trials  so  that  the  significant  £_  represents  decrement  for  the  HWL  group 
in  Test  Periods  6,  7,  8,  9,  and  10.  The  L..l  and  HWL  adjusted  means  were  5,78  and 
11.53  seconds,  respectively. 
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aperature  during  a  trial.  A  low  score,  therefore,  reflects  good  performance . 

The  means  ara.  shown  in  Figure  IB  as  a  function  WL  and  Test  Period.  The  mean 
differences  are  not  reliable.  An  analysis  of  Covariance  on  Trials  6,  7,  8,  9, 
and  10  combined  failed  to  reach  significance,  £.(1  ,21  )  =  .099.  Figure  18  pro¬ 
vides  a  graphical  representation  of  this  data. 

Size  Estimation 

The  unit  of  measure  for  Size  Estimation  was  absolute  centimeters  error.  Ten  judg¬ 
ments  were  made  each  test  period.  The  means  and  standard  deviations  of  the  abso¬ 
lute  error  were  computed  and  used  as  raw  scores  for  the  analyses. 

Figure  19  shows  the  average  error  when  the  means  of  test  periods  were  used.  Sig¬ 
nificantly  larger  errors  were  made  in  the  later  work  periods  but  they  were  not 
associated  with  WL. 

The  means  of  the  Size  Estimation  standard  deviation  score  are  shown  in  Figure  20. 
There  were  no  meaningful  trends  in  these  means. 

Vigi lance 

Oaring  the  Discrete  Tracking  task  a  light  was  presented  on  some  trials.  When  S 
detected  the  light  in  his  peripheral  vision  he  was  to  extinguish  it  with  a  push¬ 
button  switch.  The  response  measure  was  RT  in  seconds  and  median  RTs  were  com¬ 
puted  for  blocks  of  test  trials.  None  of  the  mean  differences  shown  in  Figure  21 
are  significant. 
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Temperature 


Oral  temperature  was  taken  four  times:  9:00  A.M.,  4:00  P.M.,  3:00  A.M.,  prior 
Asleep,  and  7:30  A.M. ,  just  after  waking'  the  dl'urna'V  cycuTIwa V c  on  f frme <T 
(See  Figure  22),  but  no  other  means  were  reliably  different.  An  Analysis  of 
Covariance,  using  the  9:00  A.M.  values  as  the  covariant  and  the  7:30  A.M.  values 
as  the  variate  failed  to  show  reliable  differences,  F ( 1 , 31 ) =  0.  An  analysis 
of  covariance  for  the  3:00  A.M.  values  indicated  that  the  HWL  had  significantly 
lower  temperatures,  F_  (1,21)  s  4,089.  The  adjusted  means  were  97.7  and  97.1  for 
the  LWL  and  HWL  groups,  respectively. 


Urinary  Tests 

Specimens  of  urine  were  taken  after  the  final  test  cycle  and  after  a  short  sleep 
period.  Three  separate  tests  were  run,  using  the  combistix  technique  so  that 
specimens  were  placed  in  categories. 

Table  HI  gives  a  frequency  count  for  the  protein  concentration,  glucose,  and 
pH  tests  as  a  function  of  WL  and  time  of  measurement.  The  results  are  virtually 
identical  for  High  and  Low  WL  with  almost  no  change  from  the  first  to  the  second 
measure. 


Heart-Rate 

Heart-rate  was  recorded  alternately  for  the  two  Ss  each  session  in  three-minute 
blocks.  Time  elapsed  during  10  heart-beats  was  digitally  recorded  and  these 
times  were  then  converted  to  Beats-Per-Minute  (BPM).  Six  to  ten  of  these  samples 
were  taken  to  represent  each  hour  and  means  and  estimated  standard  deviations  were 
computed  for  each  nour.  Missing  data  prohibited  the  use  of  10  samples  every  hour, 
but  no  sample  less  than  six  was  used. 
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Figure  23  shows  the  means  for  WL  and  Time  of  Day  (TD)  when  the  mean  BPM  for  each 
hour  were  used  as  raw  scores.  The  WL  groups  differ  Initially  and  the  difference 
remains  essentially  constant  across  TP,  Correcting  for  Initial  differences  the  __ 
curves  are  quite  similar  with  peaks  correspondl ng  to  feeding  times. 

Figure  24  shows  heart*rate  variance  where  SDs  were  used  as  raw  scores.  Again 
the  WL  groups  differ  throughout  with  essentially  no  WL  X  TD  Interaction. 

For  the  sleep  period,  heart-rate  scores  were  recorded  and  treated  similar  to 
waking  values  except  that  they  were  not  analyzed  by  hours.  Rather,  they  were 
combined  into  a  single  mean  and  50  for  each  S_. 

When  means  were  used  as  raw  scores,  the  LWL  mean  was  58.0  BPM  and  the  HWL  mean 
was  57.0.  The  F  (1,16)  =  .084  was  not  significant.  An  analysis  of  covariance 
statistically  equating  the  groups  also  failed  to  be  significant,  F ( 1 ,15)  =  .058. 

Using  SC  scores,  tne  mean  SO  for  the  LWL  group  was  8.45  BPM  and  7.44  for  the  HWL. 
This  difference  was  not  significant,  F  (1,15)  =  .970. 


Basal  Skin  Resistance 

Basal  Skin  Resistance  (BSR)  was  taken  in  ohms  resistance  and  converted  to  log 
ohms  resistance.  One  sample  was  taken  each  hour  and  the  WL  by  TD  means  are 
plotted  in  Figure  25.  Resistance  decreases  throughout  the  day,  but  there  is  no 
differential  decrease  for  tne  two  WLs . 

Sleep  BSR  is  shown  in  Figure  26.  Again,  resistance  decreases  across  hours, 
but  not  differentially. 


Respiration  Rate 

Respiration  was  recorded  in  wave  form  on  strip  charts.  Ten  one-minute  samples 
were  taken  for  each  hour  and  a  mean  was  computed.  These  means  were  then  used  as 
raw  scores  and  they  are  summarized  in  Figure  27.  RR  decreased  over  time,  hut 
tne re  was  no  differential  decrease  as  a  function  of  WL. 
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Mean  of  Median  Response  Times  in  Seconds 


Wl  -  F(l.M)  •  1.156  ns 
TP  Blocks  -  F(2,40)  •  2.683  n$ 
Wl  X  TPB  -  F<2.40)  •  1.561  n< 
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Figure  27  . 

Mean  Oral  Temperature  In  degrees  Fahrenheit  as  a  function 
of  Time  of  Oly  end  Wl. 
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Figure  23  Mean  Heart-Pate  In  BPM  as  a  function  of  Wl  and  Tine  of  Day. 

HWl  was  also  adjusted  for  Initial  Heart-Pat*  differences  and 
these  adjusted  values  are  plotted. 

Figure  24  • 

Mean  of  Heart-Pate  Standard  Deviations  Scores  as  a  Function  of 
Wl  and  Time  of  Day.  Standard  Deviations  were  computed  from 
ippro«1mately  ten  samples  for  each  hour. 

Comparison  of  Discrete  Tracking,  Continuous  Tracking,  and  Time  Estimation 

In  order  to  see  the  relationship  among  DT,  CT,  and  Time  Estimation  Error,  they 
are  plotted  together  in  figure  28.  The  CT  HWL  function  was  adjusted  for  easier 
comparison,  The  adjustment  is  the  mean  difference  between  the  LWL  and  HWL  on 
the  first  two  Test-Trial  cycles.  Comparisons  are  also  given  in  Figures  29,  JO, 

31,  and  32,  for  Si,  S2,  Sll,  and  512,  as  examples  of  individual  performance  curves 


TEST  OF  BARTLETT'S  HYPOTHESIS 

Two  variables  allowed  for  a  test  of  Bartlett's  Hypothesis,  Discrete  Tracking, 
and  Size  Estimation.  The  hypothesis  predicts  increased  variance,  perceptual 
break-up,  operator  distress,  and  performance  decrement  -  in  that  order. 


Discrete  Tracking 

rc  determine  whether  performance  became  more  variable  in  the  HWL  condition  in 
successive  test  periods,  the  Standard  Deviation  for  each  Test  Period  for  the  ten 
TOT  scores  eacn  S  produced,  were  used  as  raw  scores. 

The  means  of  these  SD  scores  as  a  function  of  WL  and  Test  Period  are  shown  in 
Figure  33.  There  was  a  perfect  reversal  from  the  prediction.  The  LWL  performance 
became  significantly  more  variable  than  the  HWL.  The  £  ratios  are  shown  on  the 
figure . 

Perceptual  break-up  would  be  reflected  in  the  interaction  between  Horizontal- 
Vertical  and  Test  Period  for  the  HWL.  These  means  are  shown  in  Figure  34,  and 
it  is  apparent  that  the  functions  are  virtually  parallel  indicating  little  or 
no  interaction.  The  A NOVA  Summary  in  Table  I  confirms  this  with  a  WL  X  H-V  X  TP 
interaction  F  ratio  (9,504)  =  .902. 
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Measures  of  operator  distress  were  not  taken  systematically.  Notes  were  made 

sai£f4u:i-fin.t.  t.n  test  this  portion 

of  the  hypothesis . 

Performance  decrement  for  Discrete  Tracking  was  described  above. 


Size  Estimation 

The  size  estimation  scores  allowed  a  test  of  increased  variance  across  work 
periods.  Ten  judgments  of  size*  estimations  were  made  in  each  Test  Period  and 
the  Standard  Deviation  of  the  Absolute  Error  scores  was  computed.  The  means 
of  these  Su  scores  were  presented  in  Figure  20.  There  was  a  slight  trend  for 
variance  to  increase  across  Test  Periods,  slightly  more  pronounced  for  the  HWL, 
but  the  ANOVA  failed  to  confirm  the  trend. 
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OPERATIONAL  DEMONSTRATION  OF  FATIBUE 

iRere  was  ample  evidence  that  some  fatigue  occurred.  The  effect  was  best 
illustrated  by  the  Discrete  Tracking  task  where  the  Low  Work  Load  (LWL)  and 
High  Work  Load  (HWL)  curves  began  to  diverge  sharply  with  the  HWL  showing 
performance  decrement,  One  might  assume  that  the  onset  of  fatigue  was  at  the 
fifth  Test  Period  but  this  is  probably  ?  gross  over-simplification.  Further 
testimony  of  fatigue  was  produced  by  Continuous  Tracking  and  the  Self  Report 
Inventory. 

The  Continuous  Tracking  task  was  used  primarily  as  a  loading  task  to  fatigue 
the  S s .  It  was  analyzed  for  symptoms  of  fatigue  under  the  assumption  that  a 
divergence  of  the  curves  especially  with  a  decrement  for  the  HWL  could  best  be 
explained  by  fatigue.  The  HWL  received  twice  as  much  practice  on  CT  as  the  LWL 
so  that  a  simple  direct  comparison  was  impossible.  Also  the  task  was  self-paced 
placing  only  moderate  demands  on  the  The  findings  lend  general  support  to 
the  DT  data. 

Responses  on  the  Self  Report  Inventory  indicated  that  the  HWL  Ss  perceived 
themselves  as  being  more  fatigued  than  the  LWL  Ss. 

By  some  combined  criteria  it  seems  a  certainty  that  the  HWL  group  was  more 
fatigued  than  the  LWL  group.  But  if  this  is  so.  how  should  the  cognitive  Track- 
to- Intercept  data  be  interpreted? 


Perhaps  the  most  decisive  comparison  in  this  experiment  was  between  Discrete 
Tracking  and  the  Cognitive  TTI  problems.  In  pilot  work  a  simplified  DT  task 
and  the  TTI  were  evaluated.  This  prelininary  investigation  indicated  that 
neither  task  in  the  form  evaluated  was  likely  to  be  a  sensitive  index  of  fatigue. 
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The  difficulty  of  the  DT  task  was  increased  so  that  maximum  effort  was  required 
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complete  miss  such  that  the  lights  were  demanding  a  new  response  before  S  could 

initiate  his  response  to  a  previous  signal.  The  pace  was  rapid  and  the  pressure 
unrelenting.  The  task  was  an  excellent  index  of  fatigue.  The  Cognitive  TT! 
problems  were  of  moderate  difficulty  with  virtually  no  time  pressures.  Alter¬ 
native  cognitive  tasks  were  considered.  Moderate  difficulty  with  temooral  pacing 
seemed  to  be  the  need,  but  it  was  impossible  in  the  constraints  of  this  investl* 
gation  (realism  and  calendar  schedule)  to  effect  an  alternate  task.  Indeed,  this 
may  have  been  fortunate  since  it  served  to  verify  and  emphasize  the  preliminary 
conclusions. 


CORRELATION  OF  FATIGUE 

The  onset  of  fatigue  is  most  certainly  coincident  with  the  first  response.  There 
is  probably  a  brief  period  after  warm-up  when  the  organism  is  best  able  to  respond. 
With  continuation  of  the  task  the  organism  becomes  less  capable  but  this  deteriora¬ 
tion  is  masked  by  the  combination  of  Increased  effort  and  tasks  that  demand  less, 
sometimes  far  less,  than  S's  maximum  effort  and  skill.  Only  a  task  which  makes 
almost  excessive  demands  on  skill  and  effort  will  immediately  reflect  fatigue. 

Other  measures  may  be  partially  sensitive  but  they  will  ’’ndicate  fatigue  only 
after  it  has  become  excessive. 

Given  the  conclusion  that  the  onset  of  fatigue  begins  with  the  first  response, 
specific  psychological  and  physiological  variables  should  reflect  fatigue  changes. 
Initially,  these  variables  would  show  extremely  subtle  deviations  due  to  the 
gradual  onset  of  fatigue.  With  progressive  fatigue  the  correlates  would  show 
larger  and  larger  deviations  from  some  nominal  value.  This  entire  sequence  of 
changes  would  be  continuously  retarded  by  recuperative  mechanisms  (Selye,  1950). 

As  fatigue  cumulates  recuperative  processes  reduce  fatigue  and  then  with  rest 
eliminate  it.  Correlates  of  fatigue  should  reflect  this  entire  cycle.  Ideally, 
the  correlates  would  be  simple  physiological  or  psychological  behavior  for  thp 
applied  setting. 
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Discrete  Tracking  and  T 1  me  Estimation  were  the  best  correlates  of  fatigue.  As 
the  experimental  session  progressed  the  HWL  performance  progressively  deterio¬ 
rated  on  these  tasks.  No  other  potential  indices  of  fatigue  showed  any  promise 
oi  p're  dmc  f  eng  f  a  t  '_gu  e . 

The  DT  task  manifested  all  the  attributes  of  a  potentially  good  fatigue  correlate 
so  it  was  not  particularly  surprising  when  DT  correlated  with  WL  and  produced  a 
significant  WL  by  test  period  interaction.  The  behavioral  processes  that  were 
fatigued  in  the  CT  task  were  presumably  necessary  for  DT  degradation .  After 
successive  loading  periods  the  CT  fatigue  was  reflected  in  DT  performance.  There¬ 
fore,  you  might  predict  from  DT  decrement  that  performance  in  the  less  demanding 
CT  task  was  Peginning  and  would  be  clearly  evident  if  the  work  continued.  This 
was  indeed  tne  case.  The  HWL  and  LWL  DT  curves  oegin  to  diverge  at  the  third  or 
fourth  loading-test  cycle  whereas  the  CT  divergence  gradually  begins  at  the  sixth 
or  seventh  load-test  cycle  (also,  recall  that  the  test  measures  of  a  load-test 
cycle  follow  tne  lead  measures). 

Time  estimation  nas  been  identified  as  a  correlate  in  other  investigations. 
r arpenter,  at  cl.  (1969)  found  significantly  smaller  time  estimation  errors  in 
Low  coaa  conditions  relative  to  High  Load  conditions,  contrary  to  the  present  data. 

Bartlett  (1943),  however,  found  larger  time  estimation  errors  in  conditions 
designed  to  produce  high  fatigue. 

Time  estimation  can  be  measured  with  a  simple  psychological  test.  It  would  be 
iaeal  for  many  applications.  Several  (5  -  6)  successive  estimates  would  be 
necessary  to  produce  a  reliable  measure.  In  most  situations  these  estimates 
could  oe  concurrent  with  on-going  activities  producing  minimal  interruption. 

It  could  be  argued  that  fatigue  was  not  severe  so  that  there  was  little  opportunity 
for  covariance.  While  it  is  true  that  Ss  were  not  worked  to  complete  debilitation, 
verbal  reports  and  behavioral  measures  indicated  at  least  moderate  fatluue.  This 
is  precisely  the  level  of  fatigue  that  would  be  important  to  predict.  Lower  levels 
of  fatigue  would  not  be  signi ficantly  detrimental  for  most  tasks  and  higher  levels 
would  be  self-evident. 
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Some  variables  Reserve  further  consideration  even  though  they  failed  to  be  pro¬ 
ductive  in  this  experiment.  Horizontal  phoria  may  be  an  Index  of  visual  fatigue 
given  tasKs  with  greater  visual  load  or  poor  liqhti1’"'  conditions.  Oral  tempera- 
-teredi  f f  ered  s4gr i  -f-i c a-^-tly  f?efwee n  UWT — and -HWt -  a  t  3:00  A  . M . — Temperature  is— & — — — 
simple  direct  measure  which  has  shown  promise  before  (Carpenter,  et  al.;  1969) 
and  should  receive  further  attention. 

The  recovery  processes  were  not  adequately  sampled  with  the  short  3-to-S-hour 
sleep  period.  Therefore,  variables  likely  to  be  associated  with  recovery  pro¬ 
cesses  should  receive  further  exploration.  They  would  be  Ideal  Indices  of  crew 
readl ness . 

It  now  seems  clear  that  special  tests  must  be  developed  to  ensure  measurement  of 
fatigue  in  operational  settings.  The  tests  must  require  traits  very  similar  to 
those  in  the  operational  setting  but  demand  maximum  skill  with  almost  excessive 
time  pressures,  Approximately  6-10  minutes  would  be  required  to  produce  reliable 
measurement.  Such  tests  should  predict  performance  decrement  in  typical  operational 
tasks  so  that  missions  can  be  terminated  before  the  impairment  actually  occurs. 

Other  simple  psychological  and/or  physiological  correlates  may  be  effective  from 
time-to-time  but  should  not  be  considered  as  primary  fatigue  indices. 


BARTLETT'S  HYPOTHESIS 

The  present  investigation  did  not  provide  for  a  good  test  of  Bartlett's  Hypothesis 
(Bartlett,  1953).  The  perceptual  display  was  not  sufficiently  complex  at  any  one 
time  to  allow  for  perceptual  break-up  and  only  two  measures  were  obtained  which 
would  reflect  increased  variance.  Perhaps  it  would  be  best  simply  to  conclude 
that  Bartlett's  Hypothesis  received  no  support. 

The  significant  reversal  in  DT  variance  demands  some  consideration.  Why  should 
performance  become  more  variable  for  the  LWL  group  and  remain  relatively  stable 
for  tne  HWl  group?  Since  performance  was  superior  In  the  LWL  group,  perhaps  this 
is  just  one  of  those  cases  where  means  and  variance  are  correlated,  contrary  to 
our  typical  statistical  assumptions.  Or  this  may  be  that  "one  in  twenty"  random 
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error  effect  that  we  expect  when  we  use  the  .05  level  of  significance 


L  U  UtJ  MU  P  I  dUb  I  U  I  t:  UlcC'r€uCa  I  t:  Ap  i  ana  u  i  On  (Or  ct  1 1  3  Oywtwnn,’  . 


There 


COMPARISON  WITH  CARPENTER,  ET  AL .  (  1969) 

Carpenter,  et  al.  (1969)  simulated  ASW  missions  of  2-hour  and  8-hour  duration. 

Navy  crews  flew  an  $2  simulator  on  five  successive  days  for  one  duration  period 
and  then  after  a  weekend  break  flew  five  more  missions  of  the  other  duration. 

The  work  environment  was  crowded,  noisy,  poorly  ventilated,  and  poorly  lighted. 

The  tasks  closely  simulated  ASW  tasks.  The  high  WL  conditions  made  only  moderate 
demands  on  the  Ss. 

The  present  study  used  tasks  related  to  ASW  but  the  study  should  not  be  considered 
a  simulation  of  ASW  missions.  Lighting  and  ventilation  were  good.  There  was 
ample  space  with  frequent  opportunities  for  the  Ss  to  move  about  and  stretch. 
Background  noise  was  of  moderate  intensity.  There  was  a  high  work  load  condition 
extending  over  14  hours  producing  extensive  demands  on  the  S s . 

Heart-rate  and  respiration  rate  seemed  to  be  indices  of  fatigue  for  Carpenter  et  al . 
but  not  in  the  present  study.  The  change  in  environmental  conditions,  the  task 
differences,  and/or  successive  versue  one-day  test  periods  could  account  for  the 
discrepancy  between  the  two  studies.  It  may  be  that  the  physiological  parameters 
will  be  sensitive  only  when  there  is  extensive  work  in  poor  environmental  conditions. 

Phcria  measures  may  be  sensitive  with  a  combination  of  high  visual  load  and  poor 
lighting.  Either  adequate  lighting  or  low  visual  load  may  invalidate  phpria  as  a 
fatigue  index. 

he  Ss'  sel ^-reports  of  fatigue  symptoms  were  similar  across  the  two  studies  with 
pernaps  just  the  slightest  increase  in  intensity  of  fatigue  symptoms  in  the  present 
investi gaticn. 
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The  major  difference  between  the  experiments  was  the  high  correlation  of  tracking 
Derformance  and  WL  In  present  Studv  with  no  anna  runt  rolat  Innehln  In  the  nnulniK 

‘  •  -  -  ■  -  ■  —  - -  r  r '  *-  •  - - 

experiment.  Again,  this  seems  to  certify  the  necessity  of  a  moderately  difficult 
^3Tk~wg'th*“cxt'r'eme“t'eimpo>'a1— pYesTuFeT  CaTpeKter,  et  al .  use3''‘pT1^t  ' perfonfiance 
in  a  standard  rate  turn,  a  maneuver  with  low  temporal  stress  requiring  a  moderate 
level  of  skill.  The  performance  decrement  trends  for  this  turn  maneuver  may 
reflect  fatigue  while  at  the  same  time  Indicate  low  sensitivity  for  the  measure. 

In  contrast,  DT  in  the  present  study  reflected  fatigue  and  proved  to  be  an  extremely 
sensitive  index.  The  DT  task  demanded  an  extremely  high  rate  of  information 
processing  per  unit  of  time. 


CONCLUSIONS 


The  HWl  Ss  were  more  fatigued  than  the  LWL  Ss . 

Discrete  Tracking  was  the  best  performance  index  of  fatigue 
but  other  measures  reflected  fatigue. 

Time  estimation  errors  reflected  the  magnitude  of  fatigue. 

Bartlett's  Hypothesis  was  not  supported;  however,  it  may  not 
have  been  adequately  tested. 

Indices  of  fatigue  must  demand  moderate  or  high  level  skills 
with  almost  excessive  time  pressures. 

Simple  physiological  or  psychological  measures  may  indicate 
fatigue  for  some  applications  while  being  insensitive  for 
others. 
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APPENDIX  A 

SUBJECT  SCREENING  AND  BRIEFING 


Subject  # 


SUBJECT  BRIEFING 


1.  Purpose:  To  study  behavior  in  simulated  long  duration  flight  operations. 

2.  The  task  involved  A  not  manual  labor,  but  more  like  desk  work. 

It  goer,  on  ill  day  an.!  it  tales  place  in  a  laboratory. 

3.  You  par. k  ho  p >•  -5. cut  for  a  full  24  hours.  You  cannot,  leave  the 
laboratory  during  that  time. 

4.  Part  of  tiio  24  hours  is  reserved  for  si  con,  which  will  come  at 
the  end  of  the  same  day,  after  the  task  has  been  completed. 

5.  Sleeping  quarters  and  bedding  are  within  the  laboratory.  If  you 
require  noro  then  <1  hours  of  sleep,  you  cannot  count  on  it  for 
this  24  hour  period. 

6.  Toilet  and  shower  facilities  are  available  in  the  laboratory.  You 
will  be  asked  to  provide  your  own  towels.  In  general,  bring  with 
you  whatever  you  normally  require  for  an  over-night  stay. 

7.  rood. (subjects  only)  will  be  supplied,  but  not  at  regular  meal  time. 

The  feed  will  be  in  the  form  cf  snacks  and  sandwiches. 

3.  You  will  be  required  to  wear  seme  electrical  connections  during 
both  the  day  and  the  sleep  period.  There  will  be  no  clanger  of 
electrical  shock.  You  will  also  wear  elastic  belts  around  your 
chest  and  stomach. 

9.  You  cannot  smoke  except  by  permission  at  planned  intervals. 


I  understand  and  accept  the  above  conditions. 

I  understand  that  if  I  refuse  to  complete  the  full 
24  hours  of  work  1  cannot  be  paid. 

Signature 


Address 


Phone  No. 
My  Age  iri  Years 


III  -  49 


GENERAL  MEDICAL 


Is  your  corrected  vision  normal 


Do  you  wear  glasses 


Do  you  now  have  any  disease  or  infection 
that  you  can  report 


Is  your  hearing  good 


Are  you  now  taking  any  medicine 


Do  you  have  any  skin  trouble 


Have  you  ever  had  a  skin  allergy 


Do  you  have  diabetes 


Has  any  of  your  family  had  diabetes 


Havo  you  ever  had  heart  trouble 


Did  you  ever  have  rheumatic  fever 


Has  any  one  ever  said  you  have  epilepsy 


Would  you  object  to  giving  specimens 
of  your  urine  during  the  experiment 


Is  there  anything  important  about  your 
health  not  already  asked.  If  YES, 
state  below. 


[* 


m 

i 

ki¬ 
ll  L» 

Can  you  avoid  use  of  the  toilet  for 
as  long  os  4  hours  during  the 
exnerimon  t 

Are  thoro  foods  you  are  not  allowed 
to  eat.  IF  YES,  name  them. 

Are  you  willing  to  go  without  regular 
meals  during  the  experiment 

.  -  ...  _ 

Do  you  have  any  unusual  food  requirement 

Can  you  arrange  your  affairs  so  that  there 
will  be  no  outside  interruption  during 
the  experiment 


Figure  A1  .  BIO-ELECTRONIC  PACKAGE 


PHYSIOLOGICAL  DATA  RECORDING  SYSTEM 


SWITCH  STLLCT 


EXPERIMENTERS'  CONSOLE  ^iiaJECTS'  CONSOLE 


(VIGILANCE) 
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SAMPLE  COGNITIVE  PROBLEMS 
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LOTTING 


"OGLcMS 


Instructions 


After*  plottirg  the  coordinates  you  are  to  compute  the  distance  traveled, 
speed,  and  compass  headlnq  for  the  following  problems, 

!.  Distance  is  derived  simply  by  measuri no  the  distance  between  two 
points  fie. ,  between  f  i  *  1  and  2).  A  ruler  is  provided  for  this 
purpose.  Ynuwin  be  given  further  instruction. 


i!.  Speed  is  derived  by  dividing  distance  bv  time.  You  get  "time'" 

Dy  subtracting  the  appropriate  times  given  in  your  problem  sheets. 

You  are  to  convert  "time'1  into  seconds.  For  example,  in  the  following 
problem  you  would  do  as  follows: 

t,  *-  07  :  09  :  09 
tf  06:03:04 
1  1  :  ‘6  :  5 


1  hr.  »  3600  sec. 

6  min.  -  360  sec. 

5  sec.  =  5  sec. 

3965  sec. 

Therefore,  "time"  eouals  3965  seconds. 

3600 


To  get  speed  you  simply  divide  the  distance  by  time. 
equa1:-  46  nautical  miles,  then  speed  equals: 


=  46 
--  46 


.  3965 

'•  3600 

360C 
*  3965 


40 


Speed  is  equal  to  40  knots. 


So,  if  distance 


HI.  Comnass  heading  is  obtained  by  plotting  the  coordinates  provided, 
drawing  a  line  between  the  points,  and  measuring  the  heading  with  a 
protractor. 


'to  30° 
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Sample  Problem  and  Its^  Solution 

Given: 

1)  lat.  32°  IT  N,  long.  153°  13'  E  (t  -  01:39:31) 

2)  lat.  32°  15*  N.  long.  153°  31*  E  (t  «  02:30:11) 

Questions: 

A.  What  is  the  distance  from  fix  1  to  fix  2? 

B.  What  is  the  speed  of  the  sub  if  it  is  traveling  from  fix  1  to  fix  2? 

C.  What  is  the  compass  headinq? 

Solution 

A.  Distance  equals  19  nautical  miles.  This  was  derived  simply  by 
measuring  the  distance  between  the  two  fixes. 

3.  Speed 

(T)  01  :  39  :  71  ©  50  3000 

-01  :  39  :  31  60  *0 

50  <10  3000  3040 


68400 

3040 


©  63400 
3040 

Speed 

C.  You  will  be  shown  how  compass  headings  are  read. 


*  22.50 

=  22.50  knots 


Note: 

1 )  t  -  time 

2)  A  "fix"  is  a  point  that  you  have  plotted. 
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All  problems  that  Involve  air  speed: 
1.  The  effect  of  altitude 


Light  plane  airspeed  Indicators  are  calibrated  for  standard  sea 
level  operation  (59°  F  -29.92"  Hg).  As  you  go  to  higher  alti¬ 
tudes,  the  air  will  be  less  dense  and  therefore  there  will  be 
less  drag.  The  plane  will  move  faster,  but  because  of  the  lesser 
density,  the  airspeed  will  register  less  than  your  actual  speed. 

This  error  can  be  corrected  roughly  by  the  rule  of  thumb,  "add 
2  per  cent  per  thousand  feet".  This  means  that  If  your  "calibrated" 
airspeed  Is  100  mph  at  5,000  feet,  your  "true"  airspeed  Is 

5  x  2  *  +10  per  cent  •  (.10)  (100)  *  lOmph  +  lOOmph  *  HOmph. 

/ 

,  / 

2.  The  effect  of  wind 

Ground  speed  Is  like  walking  on  an  escalator.  If  you're  walking 
against  the  escalator  at  Its  exact  speed,  you  make  no  progress. 

By  walking  with  the  escalator  your  speed  Is  added  to  Its  speed. 


Air  mass  (wind)  speed  -20  mph 

i'  i’  ill 


calibrated  speed  *  100  mph 


Ground  speed  =  100  -  20  =  80  mph 


calibrated  =  100  mph 


A  |  a  t  t- 

Air  mass  (wind)  speed  -  20  mph 
Ground  speed  =  100  +  20  =  120  mph 
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EXAMPLES: 


1 .  Altitude 

At  what  altitude  must  a  plane  be  flying  in  order  for  it  to  take  3  hours 
to  fly  450  nautical  miles  if  it  had  a  25  knot  tail  wind  and  was  flying 
at  95  knots? 

Solution: 

a.  air  speed  =95+25  120  knots  (speed  pi u •  wind  factor) 

b.  land  speed  =  45(1  4 ■  3  -  150  knots  (distance  to  be  flown  divided  by  time) 

c.  correction  factor  per  thousand  feet  =  .02  x  120  =  2.40  (correction  factor 
times  air  speed) 

d.  150  -  120  =  30  (land  speed  subtracted  by  air  speed) 

e.  altitude  =  30  4-  2.40  =  12.500  feet  (difference  between  land  speed  and 
air  speed  divided  by  correction  factor) 

2.  Time 

Flying  at  6.r000  feet,  with  a  20  knot  tail  wind,  at  100  k  how  long  will  it 
take  to  go  500  nautical  miles? 

Solution: 

a.  2  x  6  =  .12  x  100  =  112K  (correction  factor  for  altitude) 

b.  112  +  20  =  132K  =  (correction  factor  for  altitude) 

c.  time  =  500  4-  132  =  3.79  (distance  divided  by  speed) 

3.  Distance 

A  plane  is  traveling  at  100  knots  at  5,000  feet,  with  a  head  wind  of  20  knots. 
How  many  nautical  miles  can  it  travel  in  2  hours? 

Solution: 

a.  5  x  2  =  .10  x  100  =  100  +  10  =  110  (correction  factor  for  altitude) 

b.  110  +  20  =  130  (correction  factor  for  wind) 

c.  130  x  2  =  260  nautical  miles  (distance  that  can  be  flown) 
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FUEL  CONSUMPTION 
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INSTRUCTIONS 


To  determine  the  amount  of  fuel  consumed,  follow  these  steps: 

1.  Convert  speed  into  true  air  speed  as  you  did  in  the  estimated  time  of 
- p rob  Terns-  .  'Remember":  you  taVs  2%  of- 5veTy'“6ne  tl  Vou s  "an id  T s  e  L 

tlie  plane  is  into  the  atmosphere.  Then  you  multiply  this  amount  by 
the  speed,  and  add  the  amount  to  the  speed.  Finally,  you  add  or  sub¬ 
tract  the  wind  factor,  This  value  Is  your  true  air  speed.  (See 
instructions  for  ETA  problems.) 

2.  Next  you  divide  the  distance  flown  by  the  true  air  speed  to  pet  the 
duration  of  the  fl ight. 

3.  Finally,  you  multiply  the  value  for  flight  duration  by  the  value  for 
the  number  of  gallons  consumed  per  hour!  This  latter  value  can  be 
found  by  looking  at  Table  1  and  finding  the  power  rating  given  in  the 
problem.  You  will  use  the  accompanying  number  for  the  multiplication. 

4.  The  product  is  your  answer. 
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TABLE  1 


'■  of  Power  Rati nq 

Gallons  of  FueV  Cons uned  Per/Hr. 

66* 

12.8  ' 

90?. 

19.1 

60? 

12.0 

50? 

9.9 

96" 

22.6 

45" 

8.9 
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T.T.I, 

PROBLEMS 
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Formulas  for  T.T.i.  P  rub  loins 


distance 

speed 


20  n.m. 
10"  k". 


Ml'S  . 


^stance _ 


.15  n.m. 


r. !  me 


3  hrs . 


-5— k-r- 


3. 


Frio  value  of  the  hypotenuse  is  equal  to  tne  square  root  of  the 
the  squarred  legs  of  the  triangle.  That  is,  in  the  triangle 
c  ( tne  nypotenuse)  equals  r-^ - 7f~ 


sum  of 


I) 


c 


nTTjoT 


C  V  500 


-  •  •  *  tne  value  of  one  leg  and  the  Hypotenuse  are  known,  the  value  of  tne 

other  leg  may  oe  found  by:  1.  squaring  tne  value  of  the  known  leg  and 
squaring  tne  Hypotenuse;  2.  subtracting  the  squared  value  of  the  leg 
from  the  value  c-f  the  squared  hyootenuse;  3.  Then  taking  the  square  root 

of  the  ootained  value.  That  is  ( — s - = —  j-= - - — 

a  =  'ic-bd  or  b  =  Vcd  - 


a  =  91309  -  90,000 


a  =  n/  1809 
a  =  43 
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Formulas  for  T.T.I.  Problems  (continued) 


For  the  first  two  parts  (A  and  B)  of  the  question  you  will  use 
listed  under1  'given".  For  the  third  part  (C)  you  will  use  the 
provided  after  the  question. 


the  information 
information 


APPENDIX  D 


SPECIFIC  TASK  INSTRUCTIONS 


III  -  70 


SPECIFIC  TASK  INSTRUCTIONS 


Instructions:  Continuous  Tracking 

This  is  a  continuous  tracking  task.  It  is  your  job  to  keep  the  green  pip  in  the 
exact  center  of  the  screen,  which  is  designated  by  the  vertical  line.  The  pip 
will  move  randomly  but  by  moving  the  "joy-stick1'  in  the  direction  you  want  the 
pip  to  go,  you  will  be  able  to  control  its  fluctuations.  You  are  to  use  only 
one  hand  while  manipulating  the  stick.  Use  the  same  hand  for  all  trials. 

The  closer  the  pip  is  to  the  center  and  the  longer  you  keep  it  that  way,  the 
more  credit  you  will  be  given. 


Instructions:  Ortho-rater  (Phoria) 

You  are  to  look  at  four  targets,  two  at  close  range  and  two  at  a  more  distant 
range.  The  two  displays  seen  at  close  range  will  be  the  same  as  the  two  seen 
at  the  more  distant  range. 

You  adjust  for  the  two  cToser  displays  first  by  pulling  toward  you  where  it  says 
ortho-rater".  Then  you  move  the  adjustor  on  the  right  side  of  the  ortho-rater 
so  that  it  is  spaced  between  the  1  and  2.  This  is  the  setting  for  the  vertical 
display.  After  you  adjust  to  this  setting,  by  looking  into  the  lens  you  will 
see  that  there  is  a  step-like  ladder  with  a  number  designating  each  stap  (range, 

1  to  9).  Also  there  is  a  discrete  horizontal  line  that  is  formed  by  a  number  of 

red  dots.  Your  task  is  to  determine  the  step  level  that  the  horizontal  line  runs 

across .  You  record  the  number  that  corresponds  between  the  numbers  2  and  3. 

This  is  the  setting  for  the  horizontal  display.  After  you  adjust  to  this  setting, 
by  looking  into  the  lens  you  will  see  that  there  are  some  horizontally-rpaced 
white  dots  with  a  number  under  each  of  them  (range,  1  to  15).  Laterally,  there 
is  an  arrow  with  black  dots  within  it.  You  are  to  determine  where  Jdao  arrow  inter- 
cepts  a  dot.  You  are  to  record  on  your  data  sheet  the  number  that  is  below  the 

dot  that  you  think  the  arrow  intercepts. 
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After  finishing  that,  move  the  adjustor  back,  to  its  previous  setting  between 
1  and  2.  Then  push  forward  where  it  says  "ortho-rater"  and  now  you  are  ready 
to  work  with  the  more  distant  display. 

First,  move  the  adjustor  farthest  from  you  (on  the  right  side  of  the  ortho-rater) 
to  the  number  5.  This  is  the  setting  for  the  vertical  display.  You  then  follow 
the  same  procedure  as  in  the  vertical  display  task  above. 

Next,  move  the  farthest  adjustor  to  the  number  6.  This  is  the  setting  for  the 
horizontal  display.  Then  follow  the  same  procedure  as  in  the  horizontal  display 
task  above. 

Finally,  move  the  setting  back  to  the  number  5,  then  go  on  to  the  next  station 
(i.e.,  the  Size  Constancy  station). 


Instructions:  Combined  Hand  Steadiness  and  Time  Perception 

In  this  task,  you  will  do  two  things.  You  are  required  to  make  an  estimate  of  a 
prescribed  time  interval;  also,  during  that  estimation  period  you  are  to  keep 
the  stylus  from  touching  the  sides  of  the  circle,  as  well  as  the  back-up  plate. 

Follow  carefully  these  procedures: 

1.  Select  the  appropriate  time  interval  to  be  estimated. 

2.  To  reset  the  timer  to  zero,  push  the  reset  buttons  in  the 
bottom  right  corner  of  the  bottom  apparatus. 

3.  To  reset  the  numbers  on  the  top  apparatus  to  zero,  push 
the  reset  button  in  the  bottom  right  corner  of  the  top 
apparatus . 

4.  With  your  dominant  arm,  take  the  stylus  into  your  hand 
with  your  palms  faced  down.  You  canr.ot  hold  the  stylus 
beyond  the  plastic  coverinq.  Do  not  place  your  arm  nn 
the  table  while  doing  the  task. 


111  -  7r 


5.  Take  the  plastic  box  into  your  free  hand  and  place  a  finger 
on  the  switch.  This  operates  both  timers. 

6.  _ Make  sure  you  are  standing  that  you  cannot  see  the  numbers 

of  the  two  apparatus. 

7.  When  you  are  ready,  extend  your  am  and  keep  the  stylus 
from  touching  the  sides.  Then  flip  on  the  switch  on  the 
plastic  box.  Bo  sure  to  keep  the  stylus  from  touching 
the  sides  and  oack  plate  as  much  as  possible.  The 

more  you  touch  these,  the  poorer  one  of  your  scores 
will  be. 


3.  As  soon  as  you  turn  on  the  switch,  begin  counting  aloud 
backwards  -  from  100  by  threes.  When  you  have  judged 
that  the  prescribed  interval  has  elapsed,  switch  off 
the  plastic  box.  You  can  then  release  the  stylus. 

9.  Look  around  and  record  in  the  appropriate  spaces  on 
your  data  sheet  the  actual  elapse  time  (from  bottom 
apparatus)  and  the  time  touching  the  plate  (from  top 
apparatus) . 


Reset  both  series  of  numbers  and  move  on  to  the  next  station. 


Instructions:  Size  Estimation 

In  this  task,  you  are  to  estimate  line  length. 

First,  move  the  t’ack  switch  from  the  "OFF"  position  up  two  notches  to  where  it 
says  "LOW".  This  turns  on  the  projector  and  projects  a  black  line  onto  the  screen 
before  you.  You  will  notice  a  scale  to  the  right  of  the  projector  with  numbers 
on  it,  ranging  from  1  to  52.  This  scale  is  in  centimeters.  Your  task  is  to  deter¬ 
mine  the  lengtn  of  the  projected  line  in  centimeters,  using  this  scale.  You  are 
to  estimate  the  line  to  the  nearest  centimeter.  When  you  have  done  this,  record 
on  your  data  sheet  the  number  which  corresponds  to  the  estimated  line  length  in 
centimeters . 
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On  your  data  sheet  you  will  notice  that  there  are  10  spaces  in  each  row.  During 
this  task  you  will  estimate  10  lines,  using  the  procedure  just  described.  There 
will  be  only  one  line  on  each  slide.  Therefore,  you  go  through  a  series  of  10 
slides,  To  do  this,  you  must  push  the  grey  button  marked  "FOR"  on  the  black  box 
oeside  the  projector.  By  pushing  the  button  once  you  will  advance  forward  one 
slide.  By  holding  the  button  down  you  can  advance  forward  more  than  one  slide, 

If  you  want  to  go  back  and  look  at  a  slide,  push  the  button  marked  "REV".  This 
button  reverses  the  slide  advancement.  The  longer  you  hold  this  button  down,  the 
farther  back  in  the  series  of  slides  you  will  go. 

On  your  data  sheet  there  is  a  column,  "slide  number  to  start  on".  You  will  always 
start  on  that  number.  On  the  slide  carousel  there  are  numbers  ranging  from 
0  to  80.  Using  the  start  number  on  your  data  sheet,  you  will  line  the  correspond¬ 
ing  number  on  carousel  up  with  the  notch  on  the  side  of  the  projector.  To  move 
the  carousel,  push  and  hold  the  grey  button  marked  "SELECT"  on  the  side  of  the 
projector  down.  While  this  button  is  down,  you  can  move  the  carousel  to  any  num¬ 
ber  that  you  want. 

After  you  have  completed  this  task  (estimated  the  length  of  10  lines),  turn  off 
the  projector.  To  do  this,  move  the  black  switch  on  the  projector  from  the  "LOW" 
position  to  the  "OFF"  position,  then  go  on  to  the  next  station  (Limb  Steadiness 
and  Time  Perception). 


(Security 


DOCUMENT  CONTROL  DATA 

if  ii fir,  1 1 / "  .if  nh<ir,:ct  m uf  imlrxinj  .uwotuli-  ii  :r.ii 


-  R  &  D 
be  emeu  • 


h. 'it  I  hr  f ;  r  /  /i  cLii'.ifii'rij 


1.  OftiGI  NATiNCi  AC Tl  VI TV  /Oir/jui-dU-  authurj 

Douglas  Aircraft  Company 
McDonnell  Douglas  Corporation 


2.1.  REPORT  GF.CUniTY  GLAliSIF  I  CAT  I  ON 

Unclassified 


2D.GnOUi’ 


3.  fienoltT  1  ;TLE 

i — Huittan  K-rror-Research  and  Analyiis-  Program,  -Summary— and  F4  nal— Report  - 


4.  DESCRIPTIVE  NOTES  (Type  of  tepurt  .<nd  hiiltisii-c  dali‘1) 

Summarizes  work  of  four  contracts  from  August  1965  to  April  1970 


P  ijTl  IC fUf.)  tf'Uu  name,  middle  Initial,  iJSt  namvj 

Lyle  R.  Creamer 
Douglas  E.  Wheeler 


John  S.  Tarr 
Jack  A.  Baldwin 


1  Richard  F.  Gabriel  Hojat  J.  Rostami  1 

6.  ru-.ro  ft t  dat  ?: 

April  1970 

7a.  TOTAL  NO.  OF  PAGES 

150 

7b.  NO.  OF  REFS 

36 

fla.  CONTRACT  Oft  GRANT  NO. 

N00189-70-C-1 321 

b.  PROJECT  NO. 

N001 88-8290-21 31 

c. 

U. 

9a.  ORIGIN ATOftT.  REPORT  NUMBKfi(S) 

MDC  J0698/01 

9b.  OTHER  REPORT  NO(S)  (Any  other  numbers  that  may  he 
assigned  this  report} 

IC-.DISTRIBLn  ION  STATE^t-.M 

All  distribution  of  this  report  is  controlled.  Qualified  DDC  users  shall  request 
through  'J.S.  Naval  Safety  Center,  U.S.  Naval  Air  Station,  Norfolk,  Virginia. 

1  S .  EUl’I’LtV.r  N  i'AHY  NO'lfcS 

None 

12.  SPC.  jRINO  Mll.l  fARY  ACTfVfi  V 

U.S.  Naval  Safety  Center 

U.S.  Naval  Air  Station 

Norfolk,  Virginia 

)  3.  A l) ST  PACT 

Because  of  the  high  ratio  of  human  error  induced  to  total  Naval  aviation  accidents1 
the  U.S.  ’.'aval  Safety  Center,  Norfolk,  Virginia,  in  1  965  funded  the  first  of 
four  contracts  to  study  l! human  error"  under  the  heading  Human  Error  Research 
and  Analysis  Program  (HERAP). 

Data  analysts  have  developed  a  data  bank  for  records  of  17,000  Navy  pilots 
(personal  data,  training  records  and  individual  flight  time)  and  accident  data. 
Statistical  techniques  were  applied  to  the  data  in  an  attempt  to  find  predictive 
measures  for  human  error  accidents.  Some  success  was  achieved. 

Two  studies  were  conducted  to  provide  additional  insight  into  the  recognition, 
measurement,  prediction  and  alleviation  of  fatigue  "/hen  defined  as  "The 
degradation  in  performance  or  affective  state,  resulting  from  previous  work." 

In  the  first  study  two-nan  crews  were  monitored  continuously  for  two  weeks 
during  high  and  low  work  periods  in  an  ASW  S-2  simulator.  Some  indicators  were 
found  but  longer  work  periods  were  needed  to  insure  fatigue. 

In  the  second  study  two-man  crews  were  run  through  24  hour  cycles  of  high  and 
low  work  loads  of  up  to  18  hours  duration.  Performance  decrement  was  obtained 
and.  one  measure  was  sensitive  to  fatigue. 


DD, 


PCfiM 

nov  c a 


,1473 


Unci assi fied 


Security  Clarification 


Unci  ass  1 f i ed 

Sc  ;M.if  1 1  y  Cla  sM  fi  c.  s  t  ior* 


Security  Classification 


